Gene Haur_3660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3660 
Symbol 
ID5735521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4601134 
End bp4602672 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content49% 
IMG OID641280809 
ProductABC transporter related 
Protein accessionYP_001546424 
Protein GI159900177 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATG TACTACTTGA AATGCGCGGA ATCACCAAAA CCTTTCCTGG GGTCAAAGCG 
CTGAATGATG TCAACCTACA AGTGCGCCGA GCCGAAATTC ATGCCTTGGT CGGCGAAAAT
GGCGCAGGCA AATCAACCCT GATGAAGGTG CTCAGCGGAG TCTATCCCCA CGGAACCTAC
ACTGGCGAGA TTATTTTCGA GGGCCAAGAA TGTCGTTTTA ACGATATTAG CGCCAGCGAA
AAACTGGGGA TCGTGATTAT TCACCAAGAA CTGGCCTTGA TTCCGTTTCT CTCCATTGCC
GAAAATATCT TTCTCGGCCA CGAAACTGCC TCACGCGGCG TGATCAATTG GAATACGGCG
ATTGCCAAAA CCCAAGCATT GCTCAAAAAA GTGGGGCTGA GTGAATCGCC CAATACCTTA
ATTACTGATA TGGGCGTGGG CAAGCAACAA CTGGTGGAAA TTGCCAAAGC GCTTGCTAAA
GAAGTTAAAC TGCTGATTCT CGATGAACCA ACTGCTAGCT TAAACGAAAG CGATAGCGAA
GCCTTGCTCG ACTTGCTGCT GCAATTCAAA CAACAAGGCA TTTCATCAAT TCTCATTTCG
CACAAACTGA ATGAGGTTTC GCAGGTTGCT GATGCGATTA CAGTGCTCCG TGATGGCGCA
ACCATCGAAA CGCTCGATTG CCACAAAGAA CCAATTAGCG AAGACCGAAT TATCCGCGGC
ATGGTTGGCC GCGATTTGAG CCATCGTTTC CCGCCGCGCG AGGCCACGAT TGGCGAAACC
ATGTTTGAAG TCCGCGATTG GAATGTCTAT CACCCACTCC ACGCCGAGCG CAAAGTTGTC
GATAACGTCA GCTTTACTGT GCGCAAAGGC GAAATCGTCG GGATCGCTGG CCTGATGGGT
TCGGGCCGCA CCGAATTAGC AATGAGTATT TTTGGCCACG CTTATGGTAA GCATATCAGT
GGCCATGTAT ATAAAAATGG CCGTGAGATC GATGTTCGCA CGATCCAAAA GGCCATTCGC
AATGGCATTG CCTATGTCAC CGAAGATCGC AAAAGCTATG GTTTGGTGTT GATCGATGGG
ATTAAAAATA ATATTACCCT CGCCAATCTA CCAGCCGTCG CCCGCAACTC GGTGATCAAC
GAGCCAAAAG AGTTGCGCGT CACCAATCAA TATCGTGATG ATTTAGCAAT TCGTAGCTCC
AGCGTCTTGC AAAAAACCGT TAATCTCTCT GGTGGCAATC AGCAAAAAGT CGTGTTAAGC
AAATGGCTGT TTGCCAGCCC CGATATCCTG ATTCTCGATG AGCCAACCCG TGGGATTGAT
GTGGGAGCCA AATACGAAAT CTACACGATT ATCAATCGCT TGGCGCAAGA AGGCAAAGCC
GTCATTGTGA TCTCTTCGGA ATTACCCGAA ATTCTCGGCA TGTGTGATCG CATTTATGTG
ATGAATGAAG GCCAGATTGT GGGCGAAATG CCTGCTGCTG ATGCCTCGCA AGAGAAAATC
ATGAAATGCA TTATGCAGCA GGAGGTTAAC CAGCCATGA
 
Protein sequence
MSDVLLEMRG ITKTFPGVKA LNDVNLQVRR AEIHALVGEN GAGKSTLMKV LSGVYPHGTY 
TGEIIFEGQE CRFNDISASE KLGIVIIHQE LALIPFLSIA ENIFLGHETA SRGVINWNTA
IAKTQALLKK VGLSESPNTL ITDMGVGKQQ LVEIAKALAK EVKLLILDEP TASLNESDSE
ALLDLLLQFK QQGISSILIS HKLNEVSQVA DAITVLRDGA TIETLDCHKE PISEDRIIRG
MVGRDLSHRF PPREATIGET MFEVRDWNVY HPLHAERKVV DNVSFTVRKG EIVGIAGLMG
SGRTELAMSI FGHAYGKHIS GHVYKNGREI DVRTIQKAIR NGIAYVTEDR KSYGLVLIDG
IKNNITLANL PAVARNSVIN EPKELRVTNQ YRDDLAIRSS SVLQKTVNLS GGNQQKVVLS
KWLFASPDIL ILDEPTRGID VGAKYEIYTI INRLAQEGKA VIVISSELPE ILGMCDRIYV
MNEGQIVGEM PAADASQEKI MKCIMQQEVN QP