Gene Haur_4277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4277 
Symbol 
ID5736136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5460973 
End bp5462634 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content49% 
IMG OID641281437 
Producthypothetical protein 
Protein accessionYP_001547037 
Protein GI159900790 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0190094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTT CAGTTAAAGC CATTCGAGGG TTGATTATTG CGGTCAGCGT ATGTGGCCTA 
CTTTTGCGGC TGATTTTTGC GCTCTTGCCC TTGCAAACCC ATTTGCTGGT GCTCGAAGAT
GATGCTTGGA TGGTGACGGC CATTGCCCGC AATTGGGCCA TGGGCCAAGG GATCACCGCC
GATGGTATCA CGCCAACCAA TGGATTTCAT CCGGTGTATC CACTCACTTT AGGGGCGATT
CCCTATGTTT TTGCCCCCGA TAACTTAGCC TTAGGCTTTA CTGTCAACTT GATTATCTGT
GCATTGTTGG CCAGTTTAGT CATGTGGCCG TTGTGGCATT TGCTGAGACA TTTGATGAAC
TGGCAAGCAA GTTTATTTGG CATAATCTTA TATGCGCTAA ACCCCGTTTT AGTGCGGTTT
ACCGTCAATG GCATGGAAAC GTCGATGGCT TTGTTGCTCT GGGTCACGAC GCTGCTAGCT
GCGGTCAAAA TCGATCCCAA AAAATTAAGC CATAATCTCG GCCTTGCCGC ACTAACGGCT
GCCATGATTC TGACCCGCCT AGATGGAGCA TTGCTCTTTG CCTCGATTGC CGCGGCTCGC
TTGATTTGGG CTTGGCGAGC CAAACGCTTA GGCCGTGAAT TGCCCATGCT CACGAGCTAT
GTCGTGGTGA CGTTTACGCT GTTAGTGCCC TATTTCTGGC GTAATTTGAC GGTATTTGGT
TCGTTTTCGC CCAGTAGTGG CAAAGCCTTG ACCTACTTGC ACAGCTACGT CAACTCCTAC
GCCATCTCAA ATGGGCTTGA TGGTTGGTAC GTTAATAGCG CAATTTCGAT GGAAGTATTG
GGACGCTCAG TAGTTGGCGC AGCGCTTTGT TTGGCCATTT TTGCGGCATT TGTGGGCTTT
TGGGTTGGTC GTCAACTCTG GCTTGGCTTA CCGCTCTTGC TCTATCTGCC GATTCCCTTG
GTTTATTATG GCTATATGAT GCAGCAGGAT AATCCACGTC ACTTTGTGCC TTGGTCGCTG
GCAGTCATTA TTTTGCTGGC ATGGGCGTTG GCGGCAATGC TCCAGCGTTT GCCATCGATC
AGCTATCTGG CCGTGCCAGC GCTGATTGCG GGCGTGCTGA TTGTGCAAAC CCTCGATAGC
TCACGCTTTT GGCAAGAAAA AGCAACAGCG CCTAGTCAAT CGCAACCAAC GATGTATCAA
GCAGCCTTGT GGATGCGCGA TAATTTGCCC AGTGATGCCT TGATCGGGGC TAAAAATTCG
GGCATTTATC AATATTATTC TGGTCATCAT GTGCTGAATA TCGATGGCAA ATTGAACAAC
GACATTCTTG AGGTCTATGA TCAACGGCGC ATGCTCGATT ATTTGCGCGA AAAAGGCGTG
ACCCACTTGA TCGATCAAGA GGGAACCATG GCCGATCATA TTCAGTTTTA TAGCTATCAA
TTTGGTGAAC GGCCCGAGCA TCGTGTGCCC ACAACCTTCA CCCAATTCAA GATCTATGGT
CAATTATTGC TGAGCAGTTT GGGGCTAGCC GATAAGCCAG CGCTTGATCG GCGCGATGGT
TTTGAGCCAA ATCAGCCATT TAGCAGCATC ACCACGGTGA TTCAGCGCTT CCCACGGCCA
AACGATAGCA ATAACCCAAT TGCGATTTTT GAACTTAACT AA
 
Protein sequence
MKVSVKAIRG LIIAVSVCGL LLRLIFALLP LQTHLLVLED DAWMVTAIAR NWAMGQGITA 
DGITPTNGFH PVYPLTLGAI PYVFAPDNLA LGFTVNLIIC ALLASLVMWP LWHLLRHLMN
WQASLFGIIL YALNPVLVRF TVNGMETSMA LLLWVTTLLA AVKIDPKKLS HNLGLAALTA
AMILTRLDGA LLFASIAAAR LIWAWRAKRL GRELPMLTSY VVVTFTLLVP YFWRNLTVFG
SFSPSSGKAL TYLHSYVNSY AISNGLDGWY VNSAISMEVL GRSVVGAALC LAIFAAFVGF
WVGRQLWLGL PLLLYLPIPL VYYGYMMQQD NPRHFVPWSL AVIILLAWAL AAMLQRLPSI
SYLAVPALIA GVLIVQTLDS SRFWQEKATA PSQSQPTMYQ AALWMRDNLP SDALIGAKNS
GIYQYYSGHH VLNIDGKLNN DILEVYDQRR MLDYLREKGV THLIDQEGTM ADHIQFYSYQ
FGERPEHRVP TTFTQFKIYG QLLLSSLGLA DKPALDRRDG FEPNQPFSSI TTVIQRFPRP
NDSNNPIAIF ELN