Gene Haur_2777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2777 
Symbol 
ID5734658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3535022 
End bp3536308 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content52% 
IMG OID641279920 
Producthypothetical protein 
Protein accessionYP_001545543 
Protein GI159899296 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATA ATTTGAGCCA CATGATTATT GATGCATGGC AAAACGGGGC CAGCCTCGAA 
ACACTCTGCC AGCAGTATCC TCACCACGCT GAGGCTATCC AACAATTAAT TAATCCATTA
ATTCAGTTAC AACGGGTCAA TCCACCAACT ATGCCAGCAC GAGCAAGCCA TGCACAAGCT
GATTTTATGC GTTTAGCGCA ACACTATCGC GCTCAAACTG CCCCAAAGCC TAAACCACGA
CGGCGCTTAT TAACTCAACG CTGGGTTTGG GCCACAGCGA CAATCCTGCT CTTGGTTTGT
TTGAGTGGCA ATTTGGTCTT ATCGGCCTCG GCTAGCGCCT TGCCAGGCGA TAGTTTGTAT
GGGATCAAAC GTTGGAGCGA ATCGATCAGC TTGGTGTTTA CGCCCAGCGC TGAACAATTA
ACTGCACGGA TCGATCTGGT CAACGAACGC CAGCATGAGA TCGCTAGCTT GGTAGCATTA
AATAAACCAG TGCCAAGCGA ATTGCTTGAT GAAGTCGTCA ACGAAACTCA GTCAATTGAG
TTGGCGCTTG CGCCGACCCA TACGAATGAC CCACGACGGA GCAAATTGAG CCAAGTCAAT
CAGCAACTGC AAACGACAAT TGCAATTATT CCAGTGGAAA ATGCGACTGA CAATCTCAAA
CATCATGATC TGATTGAGAC GCTTGATCAA AGCCGTCAGC GCATCGATAT TGCCAATCTG
ACGACTATTC CGACCCAACC AGCGAAGGTT TTGGTTGGTC CAACAGCAAC CAAGCAGCCA
AGTATCGCGC CAGCGGCAAC TGATCTGCCA CATATTGCAA ACCCAAAGCT CCCGCCAAAG
CCACATACAC CAACGCAGGA GCCAACCGAA GTCGTTTTAC CAACCTCAAC GAGCACACCA
TGGCCAACCG CGAAACCAAC GCGGGTGCCG CCGCCTGTGA TCAAACCAAC TGCCACTGCG
ACTTCGCTAC CCACATCAGC CCCAACCGAT GTGCCAATGC CCACATCAGC CCCAACCGAT
GTGCCAATGC CCACATCAGC CCCAACCGAT GTGCCAGTGC CCACCGAAAT TCCGAGCATC
GGGTTACCAA CCGCCACGCC AACGATTGTG CGTGAGCAAC CAACTGCGCC TGTCCCAACC
AAAGATCCTG GCGATATTAA GCCAACCGAA GTTCCACCAA CTTCACCACC AACTTCACCA
CCACCAACGA AGGAGCCACC GCCGCCTTCA CCAACGAAAG AGCCAGAGGA TACCAAAGGT
CAGCCAAACC CCATTCTCCA ATATTAA
 
Protein sequence
MNDNLSHMII DAWQNGASLE TLCQQYPHHA EAIQQLINPL IQLQRVNPPT MPARASHAQA 
DFMRLAQHYR AQTAPKPKPR RRLLTQRWVW ATATILLLVC LSGNLVLSAS ASALPGDSLY
GIKRWSESIS LVFTPSAEQL TARIDLVNER QHEIASLVAL NKPVPSELLD EVVNETQSIE
LALAPTHTND PRRSKLSQVN QQLQTTIAII PVENATDNLK HHDLIETLDQ SRQRIDIANL
TTIPTQPAKV LVGPTATKQP SIAPAATDLP HIANPKLPPK PHTPTQEPTE VVLPTSTSTP
WPTAKPTRVP PPVIKPTATA TSLPTSAPTD VPMPTSAPTD VPMPTSAPTD VPVPTEIPSI
GLPTATPTIV REQPTAPVPT KDPGDIKPTE VPPTSPPTSP PPTKEPPPPS PTKEPEDTKG
QPNPILQY