Gene Haur_3912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3912 
Symbol 
ID5735773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4905573 
End bp4906796 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content49% 
IMG OID641281063 
Producthypothetical protein 
Protein accessionYP_001546674 
Protein GI159900427 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAACG AAATCGTCCA AATCGATTAT CAGGAGGCCA ATCAGCTAGC CCAACGTTTC 
TCGAAGCAAC AAACCCACGT GCAGGCAATT TTGCAAAGCC TCAGGCAGAC CATTCAAGCG
CTTGAGAATG GCGGCTGGAT GGGCGATGCA GCAACCGCAT GTTTCAAAGA ATTTCATGCC
GAGGTAGTCC CTGCCTATAC CCGACTCAGT AACGTCCTTG GTGAAGGCCA AAGTACCCTG
CAAGGTATTG CAACGCTCTT TCGTCAGGCC GAAGAAGAAG CTGCCGCACT CTTCCGTGGC
GAGTTTGCCG CCGCCGCTGG GGGTGAAAGT GGTATCGCCC AAGCCGTCGG TGGGAGTGGT
GGCAACATTC AAGCCGACAA TGGTGGTATG TATCAAACAG TTCAGGCATT TGCCACTAAT
TTAAGTGGTA GCGGTGGTGG ACAAGGCCAA GCAAGCGAGC TAGAAACCCT CATCCGTTCG
CTCAAAGATC AATGTGTAGA CCCCAATGTT ATTTTAGATG CAATTGCCAA GGCAACCCCA
GCAGAACGTC AAGCGATTTT AAATGACCCT CAATTAATGG AATATATTCG CATCAGCGAA
GGTACAGCTG CCGATGTCCT TACGGCGGCA TTGCTCGAAG GAGCATTATT CTGGCCAGCT
GGCTCAGGCC CAGCCAATAA TGGGGCGATT CAATCTGATG TGATTAACCC TTTAACCGGA
GAAGAACGAA CTAATGATTT TGCACTTTGG ATGCGTGGCG AAGCAGGGCC ACCTGATCCG
CTTAACGGCA CGATGAATTG CTGGGAAGCA ACCATGTATG CCGCCTATTT AAGTGGCGAA
ATCAGCGAAA GTCAATTACG CGAAATTCAT CAAAACGCGG CTGATGCAGG CGCTGATTAT
TATAATGTGA TTGAAGATGC CTTTGGTGCT GATAATCGCA GCACATGGCA ATCAGGCGAT
CAACCAGCCG CAGGCAGTAT TGTCTTTTTT GAAAGTGACG GTAGCCCACT TGCTCACGTT
GCAATTGCTA CAGGCCGAAC AACCCCTGAT GGCAAGACTG AGATCATGAG TTTATGGGTA
TTGCCTCAAG ATTCAAATGG TAATTTTGTG CGATCAATGC AACGTACCAC CATTGAAGAT
TTACAGGCCT CAATGGCCGA CGCTGGCATT CCATTAGATC AAATTAGCAC TTCACCCAAT
CCATGGGATC AGCCGAATGA TTAG
 
Protein sequence
MGNEIVQIDY QEANQLAQRF SKQQTHVQAI LQSLRQTIQA LENGGWMGDA ATACFKEFHA 
EVVPAYTRLS NVLGEGQSTL QGIATLFRQA EEEAAALFRG EFAAAAGGES GIAQAVGGSG
GNIQADNGGM YQTVQAFATN LSGSGGGQGQ ASELETLIRS LKDQCVDPNV ILDAIAKATP
AERQAILNDP QLMEYIRISE GTAADVLTAA LLEGALFWPA GSGPANNGAI QSDVINPLTG
EERTNDFALW MRGEAGPPDP LNGTMNCWEA TMYAAYLSGE ISESQLREIH QNAADAGADY
YNVIEDAFGA DNRSTWQSGD QPAAGSIVFF ESDGSPLAHV AIATGRTTPD GKTEIMSLWV
LPQDSNGNFV RSMQRTTIED LQASMADAGI PLDQISTSPN PWDQPND