Gene Haur_3661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3661 
Symbol 
ID5735522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4602669 
End bp4603886 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content50% 
IMG OID641280810 
Productmonosaccharide-transporting ATPase 
Protein accessionYP_001546425 
Protein GI159900178 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4214] ABC-type xylose transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCTG AACTAAAAAC CAACGAATCC AGCAAAGCGG CTGGTTTCAA TTTCGCCACC 
GTTGTCGATT TCTTCAAAAA TAATATGCGC GAATATGGCA TGCTGGTTGC CCTTGTGATC
ATTGTGCTGT ATTTTCAATT TCAAACCGAT GGGGTGCTGC TGCAACCACT CAATGTCACC
AATGTGATTT TGCAAAATAG CTATATCGTG GTGATGGCCT TGGGGATGTT GCTGATCATC
GTGGCAGGCC ATATTGACCT CTCGGTCGGC TCGGTGGCAG CCTTCGTTGG AGCGGTGGCT
GGCTATATGC TGGTCGAAAA AGATTATCCA ATCATTGTTG CCTTTGCTGC TTGTATCGCC
ATCGGTGCAG CAATTGGCGC GTTTCAGGGC TATTGGGTGG CCTTTCGCAA AATTCCAGCC
TTTATTGTGA CCCTCGCGGG CATGTTGATT TTTCGTGGCT TGACCTTGGT CATGTTAGGT
GGTACATCAC TTGGCCCCTT CCCCTCAAGC TTCCGCGAAA TGAGCACAGG CTTTATCGGC
GACCCAATCG GCGGAGCTGA ATCAGCTCTC CACATCACGA CACTGATCGT AGGCTTAGCC
TTTGCGGCAA TCATCGTTTA CTTAGATCTA GTCGGTCGCA AAAATAGCCG CTCGTTCAAC
TTCCCAGTCA TGTCAACTCC ATTGTTTATC GCCAAAAATG CCTTGATTGT TGGGATTATT
CTAGCCTTCT GCTATATCCT CGCTTCCTAC GAAGGCTTGC CCAATGTCTT ACTAGTGTTG
ATCGGCTTAA TCGCGCTGTA TATGTTTGTT ACCAAAAAGA CTGTGATTGG GCGACGGATT
TATGCCTTGG GTGGCAACGA AAAAGCCGCT AAGCTCTCTG GTGTGCCAAC CGACCGATTA
ACCTTCTATA CCTTCATCAA CATGGGGGTT TTGGCAGCCA TCGGTGGGTT GATCTTCACC
GCTCGTCTGA ACTCGGCCAC ACCCAAGGCA GGCACAGGCT TTGAGCTTGA TGTAATCGCC
TCAGTCTTTA TCGGTGGTGC ATCAGCCTCA GGTGGCATCG GCACAGTCTT TGGCGTGGTG
ATCGGGGCAC TGGTGATGGG CGTGCTCAAC AACGGTATGT CGATCGCTGG GGTAAGCGTC
GATTGGCAAC AAGTGATCAA AGGCGCTGTG CTGTTGGTCG CAGTCTACTT CGATGTATCC
AGCAAAAATA AAGGTTAA
 
Protein sequence
MSAELKTNES SKAAGFNFAT VVDFFKNNMR EYGMLVALVI IVLYFQFQTD GVLLQPLNVT 
NVILQNSYIV VMALGMLLII VAGHIDLSVG SVAAFVGAVA GYMLVEKDYP IIVAFAACIA
IGAAIGAFQG YWVAFRKIPA FIVTLAGMLI FRGLTLVMLG GTSLGPFPSS FREMSTGFIG
DPIGGAESAL HITTLIVGLA FAAIIVYLDL VGRKNSRSFN FPVMSTPLFI AKNALIVGII
LAFCYILASY EGLPNVLLVL IGLIALYMFV TKKTVIGRRI YALGGNEKAA KLSGVPTDRL
TFYTFINMGV LAAIGGLIFT ARLNSATPKA GTGFELDVIA SVFIGGASAS GGIGTVFGVV
IGALVMGVLN NGMSIAGVSV DWQQVIKGAV LLVAVYFDVS SKNKG