Gene Haur_0298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0298 
Symbol 
ID5732193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp355734 
End bp357194 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content53% 
IMG OID641277422 
Productextracellular solute-binding protein 
Protein accessionYP_001543078 
Protein GI159896831 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGCT CACGCCGTCC GTTTATTACT TGGCTAAGCC TCTTAACTCT GATTTCAATG 
ATCTTGGTTG CTTGTGGTGG AGCTGAGACC GCTCCCACTG CAACCACTGC TCCAGCTGCT
ACCGCTACGA CTGCGGCTCA AGTTGAACCA ACTGCTGCTG ATGCTCCAAC CGAAGTTGCC
GCAACCGCTG AACCAGCAAC CGCTGAGCCA GCTGGTGGCG ATGTTGTTAC CTTGAAGCTG
TGGCACATGC CTAACGGTGC TGCTCCAGCC GATGCCATCC AAGCTGAAAT CGATGCCTTC
GAAAAAGCTA ACCCCGGCAT TAACGTTGAG CCAGAAATGC TCGACTGGGG TGCAGCTTTC
CAACGCATCC AAACCTCGGT TCAAGGTGGT GAAGGCCCAT GTATTACCCA ATTGGGTACG
ACCTGGAACC CAACCTTTGC GGCAATGGGT GGCTTACGCC CATTCACCGA AGAAGAAATC
ACCGCTATGG GTGGCAGCGA TAGCTTCGTC GCCGCTTCAT GGGCAACCTC ACAATTGCAA
GGTATGGAAG GTACATTCTC GATTCCATGG TTTGCTGATG TTCGCGCCTT GGCCTATCGC
AAAGACTTGT TGGAAAAAGC TGGCTTGAAG CCAGAAGAAG CCTTCAAGGA TTGGGCCAGC
TTCAAAACGA CCTTGGCTAC CATCCAAGAA CAAAATCCTG ACGTTGCTGG GATCGCTTTC
CCAGGCCGCA ACGACTGGAA CGTTTGGCAA AATAGCTCAA TGTGGATTTG GAACAGCGGT
GGCGATTTGT TGAGCAGCGA TCTCAGCGAA GCAACCTTCA ACTCAGAAGC AGCTGTGGCT
GGGGTTTCAG AATTTGCTAA CCTCTACAGC AGCAAATTGA CCGTTACCAA TACCTTGGAA
TTGAACTCAG CCCAAGTTGA TGCCAGCTTT GGCGATGGCC GCACCTTTAG CGCAATCACT
GGCCCATGGT TGATCAGCAA CGCTCGCACT GCTGCCGACG CTGGTGGCTG GGCTAACCGC
ACCGTGGCCG ATAACTTGGC CTACGCCGAA TTCCCAGCTG GCCCTGGTGG CTCATACACC
TTCGTTGGTG GTAGCAACTT GGCCATCTTG AAGAGCTGCG AAAACGCCGA TGCCGCTGTC
AAGTTCGTGC AATTCTTGGC TGCTAACGAA TCACAATTGC GCTATAGCCA AGCCATCGGG
ATGTTGCCAG CAACCAAGAC CGCTCAAGCC GATGCTAGCA TCGCCAGCGA TGCCTTGTAC
AGCGTCTTCA TCGCCGCTGC TGCTAAGGGC AAAACCTCAG CTCCAATCGC TGAATGGGGT
CAAGTCGAGA GCGTTCTCAA CGAACAACTT GGTTCACTCT GGGACGATGT AGCAACCGCT
GGCGGTCCAG TTAGCGCTGA AGTCGTCAAG ACCCGCTTGG ATCAAGCTGC TCAAACTGTC
AACGAATTGT TGGGTAACTA A
 
Protein sequence
MQRSRRPFIT WLSLLTLISM ILVACGGAET APTATTAPAA TATTAAQVEP TAADAPTEVA 
ATAEPATAEP AGGDVVTLKL WHMPNGAAPA DAIQAEIDAF EKANPGINVE PEMLDWGAAF
QRIQTSVQGG EGPCITQLGT TWNPTFAAMG GLRPFTEEEI TAMGGSDSFV AASWATSQLQ
GMEGTFSIPW FADVRALAYR KDLLEKAGLK PEEAFKDWAS FKTTLATIQE QNPDVAGIAF
PGRNDWNVWQ NSSMWIWNSG GDLLSSDLSE ATFNSEAAVA GVSEFANLYS SKLTVTNTLE
LNSAQVDASF GDGRTFSAIT GPWLISNART AADAGGWANR TVADNLAYAE FPAGPGGSYT
FVGGSNLAIL KSCENADAAV KFVQFLAANE SQLRYSQAIG MLPATKTAQA DASIASDALY
SVFIAAAAKG KTSAPIAEWG QVESVLNEQL GSLWDDVATA GGPVSAEVVK TRLDQAAQTV
NELLGN