Gene Haur_2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2204 
Symbol 
ID5734091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2800811 
End bp2801818 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content52% 
IMG OID641279345 
Productmonosaccharide-transporting ATPase 
Protein accessionYP_001544972 
Protein GI159898725 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00482964 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGC CAACCAACAA GCCGCAAGTT CTGGTAAACT CCGTCGGATA TTCCCGCCGT 
GATTTTCTGC GTTTCGGGGC GTTTACCACT GTCGGGCTAC TCGCTGGCTG TGGAACCACC
AGCGCCACGC CGGTAAGCAC CGCCACCCCA ACTAAGCCCA AAATTGGCTT GGTCATGAAA
TCGCTGAGCA ACGAGTTTTT CCAACAAATG CAGGCTGGCG CTGAAGAATA TGCCCGCCAA
AATGCAACCC GCTTTGATTT CACCGCCACC GGCATTAACG ATGAGCGCGA TTTCGCCACC
CAAATTGCTT CATTCGAGCG CTTAGTTAAT GAGCAATATG ATGTGATTGT GTTGGCTCCC
GCCGATTCGA TTGCCTTGGT TGCCCCGGTT GCCAAAGCGG TCAAAGCTGG CATCGTCGTG
ATCAATATCG ACGTTGCGCT TGATGAAGCA ACCAAAAAAG CTGCTGGCAT CGATCTGGCC
TTTTTTGGCC CCGATAATCG TGCCGGAGCC AAAATGTCGG GCGAGGTGCT GGCCAAAGCC
TTAGGCGCAG GCGGCAAAGT CGCAGTGCTC GAAGGCAATC CAGAGGCCGA TAATGCCGTC
CAACGCCGCT TAGGCTTCGA CGATGCCATC GCCGATGGCA GCTTAAATTT AGTTGTTGCT
GAAAGTGGCC ACTGGGAAAC CAGCGAAGGC CAAAGCATTA CCGCCGCATG GCTCAAAAAA
TATCCCGACT TACAAGGCAT CATGTGCGCC AACGATTCAA TGGCTTTTGG GGCAGTCCAA
GCACTCGAAG CCGCCAATCT GCTCGATAAA ATCAAAGTCG TGGGCTTTGA TAACATTCCT
GCTGTGCAAC CCTTGATCAA AGATGGTAAA ATGCTGGCGA CCGTTGAACA ATACGGTGCG
CAAATGGCAG CAATCGGTAT GGATTATGGC TATCGCACGC TCAAAGGCGA GAAATTTAGC
GGCTGGATTC GCACCGAGCT AAAATTAATC ACTAAAGATA ATCTCTAG
 
Protein sequence
MEKPTNKPQV LVNSVGYSRR DFLRFGAFTT VGLLAGCGTT SATPVSTATP TKPKIGLVMK 
SLSNEFFQQM QAGAEEYARQ NATRFDFTAT GINDERDFAT QIASFERLVN EQYDVIVLAP
ADSIALVAPV AKAVKAGIVV INIDVALDEA TKKAAGIDLA FFGPDNRAGA KMSGEVLAKA
LGAGGKVAVL EGNPEADNAV QRRLGFDDAI ADGSLNLVVA ESGHWETSEG QSITAAWLKK
YPDLQGIMCA NDSMAFGAVQ ALEAANLLDK IKVVGFDNIP AVQPLIKDGK MLATVEQYGA
QMAAIGMDYG YRTLKGEKFS GWIRTELKLI TKDNL