Gene Haur_3484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3484 
Symbol 
ID5735345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4386641 
End bp4387996 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content51% 
IMG OID641280631 
Productleucine-rich repeat-containing protein 
Protein accessionYP_001546248 
Protein GI159900001 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000424211 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATGC TCGATCTGAA CCAAGTTGAT CAAGCCGCCA TGCTCAGCCT TGCCCGCAAC 
CCCCAAACCG ACCCACAACA ATTAATTGCG CTTACCGAGT GGCTTAAACT CCAACAAGGC
GCTGATTCGG CTCAGCCTAG CACGAGTTTC GCTTACCTCA AGAGCCAAGC ACCGCAAGGC
CCACTGGCAG TTATCTTAGA ACGCCGAACC TCGCCGCTAC TCGAAGCCTT GATTGCCAAC
CCCAATCTTC CGCCAATGCT GGCTTTAGAA TTTGCCGCCG ATGTGCCAGC TGCATTTTTT
GCCAATCCGG CATTGCCCAT ATGGTTGCAA CACGATCCTG CGTTGTTTAA GCGTATGGAA
CCACTACGCT GTATGCAGTT TTTAAGCTAT CCAGCTATTC CCCAAGTTAT TTTAGCGTCG
ATTCAAAGCA TTAGCCCTGA AATTGCTCAA ACCGTGCACC TGCATAGTGC TTCAAATCCT
CAGCTTGATG CCGATTGGTA TGCCGACTAT CAGCATTACA GAGAGCAAGT GGCCTTGCCC
GATGCCACAG CAACAACATT ACTGCAAGAA TTAATTGGCT TGAATGCGAT TAATCAGCCG
ATGTTGAGCT GGCTACGCCA ATCACCAGCC GAGCAGCATC AAGCCTTATT TAATCGAGCG
CCAGCTACGC CACAGCCTGT GATTGAGCCA CAAACGCTTA ACTTTACCCC AAACTATCCA
CGATTGCTCG AATCTCCGTT GGCCGAACGA ATTCAAGTAG CGCATTCCAA TGATATTAAG
GGCTTGGCAA TTTTGGCTGA AGACGACGAT CTTAGCATTC GACTGCTAGT GGCCCAAAAT
CCGGCAACTC CGCGGACTGT TCATCAACTA CTGGAGCTTG ATGATTCGCA GCATGTGCGA
GCGGCACTAG CGCGGAATCC CAACATTAGC CCAAAATTAC TGCTGACACT AGCGCGTGAT
TACACGTGGT CGGCAGTGCC AATTCGTGTG GCAGCAGCCC TCAATCCAGT CGCAACTTCC
GAAATTCTAG AGTTGCTGGC CCAAGATCAA GCCTCGTTGG TACGTCAAAC GGTTGGGCAA
AACCCGCAGG CCTCAGCTGA AATACTTGAT CATGCCCGCC AGCGAGCACT AATCGAAGCC
CTGTATGCGC TCGATCCCTG GCTGCATATG CTGGCATTGG GAAACCCCGC GACCCCAATT
GAGCATTTGG CGAAGGGCGC TCGCTCCCCG TGGTGGTTGG GGCGAGCAGC TTTAGCCGAA
AACCCGAGTT GCCCAAGCAA TGTGCTTGAG CAATTGACCA ATGACGGAAA TTGCTATGTG
CAACGCTTGG CACAAACTCA ATTGAATGCT CGTTAA
 
Protein sequence
MNMLDLNQVD QAAMLSLARN PQTDPQQLIA LTEWLKLQQG ADSAQPSTSF AYLKSQAPQG 
PLAVILERRT SPLLEALIAN PNLPPMLALE FAADVPAAFF ANPALPIWLQ HDPALFKRME
PLRCMQFLSY PAIPQVILAS IQSISPEIAQ TVHLHSASNP QLDADWYADY QHYREQVALP
DATATTLLQE LIGLNAINQP MLSWLRQSPA EQHQALFNRA PATPQPVIEP QTLNFTPNYP
RLLESPLAER IQVAHSNDIK GLAILAEDDD LSIRLLVAQN PATPRTVHQL LELDDSQHVR
AALARNPNIS PKLLLTLARD YTWSAVPIRV AAALNPVATS EILELLAQDQ ASLVRQTVGQ
NPQASAEILD HARQRALIEA LYALDPWLHM LALGNPATPI EHLAKGARSP WWLGRAALAE
NPSCPSNVLE QLTNDGNCYV QRLAQTQLNA R