Gene Haur_4519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4519 
Symbol 
ID5736370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5786045 
End bp5787319 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content51% 
IMG OID641281682 
Producthypothetical protein 
Protein accessionYP_001547279 
Protein GI159901032 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000498589 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGCTG AAATCGTTCA AGCTCAGTAT GACCAACTCA CTCAAGTTGC AAGTCGGTTT 
GGCAAGCAAT CGGAGGTAGT TGATCAAATC AACAGCCAAG TTCGCCAGAG CTATGAAACA
TTAGCCAATG GCGGATGGAT GGGTGATGCG GCAAAGGCAT TTTTCAATGA AATGCAGACG
GAAATTTTCC CCACAATGCA ACGTCTAACA GGTGTATTGC GGGAAGCTCA AACGGTAACC
CAACAAATTA GTACGATCTT CCAGCAAGCT GAACAAGAGG CAGCTAAGGG GATCACATTT
GCTGATGGTG GAGCTAGCTC AAGTGCTGGC AGTGGTGTTT CGTTCAGCGC AGCAGCAGGT
AATGCGGCAG GCTCGGCGGC AGCGAACCAG TTGCCACCAC CACGCATGTA CATTGTCAAC
GGGATTAACG CCAGCGAGCC AGATGGTACT CCAGGTGAAG GGCCACAGCA ACTGGCCGGC
TTGTTGGCTG CCCACGGCTA TGATCCTAGC CAAATCAAGG CGATGCCGGC AATTTACAAC
ACCAACTACA CCACCAACTT GCAAGGCACC GATTTGCAAG GTACCAATCA TGGTGGTTGG
TTATCGCCCG TCGATTGGCT GACCGGAGCC GGAGCCTCGA TCGTGAATGG AGTTACGGGT
GCCGGTGCCA GTGTTGTCAA TGGGGCTTCG GCGTTGTTTA ATACTGGAGT CGGGGTCAGC
GAAGTTGTGC AAGAGTATAC AATGCAAGAT CAAGGCAAGT ATACCCAGGA AAGCTACAAC
TTTATTCAAC AAGACCTTGC CCGTAACCCA TTATTGCCCG GCCAAACCGT CATGCTAATT
GGGCATAGCG GCGGTGGGGC AGTTGTCAGT AACCTCGCGC CAATGCTTGA AAATAACATG
GGCGTTGATG TTTCTGGGGT GGTTACGCTC GGGTCGCCGG TAGCCAATGC TGATCGGGCG
ATGCAATATG CCAAATTCCT CAGCGTTAGC GACAAAGGCG ATTATATTGG CCAACCATGG
ATTCGCTCCG ATGAAGGGCG TAATTTCCTA ACTCCAGGCT TGATGACTGG CATTTTAGCG
CCGAAATCCT TGCCATTGGT TGTGCCAGGG GTGCTTGGAG CCGATAACGC CGCCCGCGAT
GCTGGGATCA ATTACTTTAC GACCAATGCC AATGCGGGCA ACCCAATTAG CAATCACAAC
TCCTATTGGA CGAGCAACGA TGTGGTTAGC ATCATTAAAA ACAGCTATCC CCAAGTTGCT
CCATACCTGA AGTAA
 
Protein sequence
MGAEIVQAQY DQLTQVASRF GKQSEVVDQI NSQVRQSYET LANGGWMGDA AKAFFNEMQT 
EIFPTMQRLT GVLREAQTVT QQISTIFQQA EQEAAKGITF ADGGASSSAG SGVSFSAAAG
NAAGSAAANQ LPPPRMYIVN GINASEPDGT PGEGPQQLAG LLAAHGYDPS QIKAMPAIYN
TNYTTNLQGT DLQGTNHGGW LSPVDWLTGA GASIVNGVTG AGASVVNGAS ALFNTGVGVS
EVVQEYTMQD QGKYTQESYN FIQQDLARNP LLPGQTVMLI GHSGGGAVVS NLAPMLENNM
GVDVSGVVTL GSPVANADRA MQYAKFLSVS DKGDYIGQPW IRSDEGRNFL TPGLMTGILA
PKSLPLVVPG VLGADNAARD AGINYFTTNA NAGNPISNHN SYWTSNDVVS IIKNSYPQVA
PYLK