Gene Haur_1180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1180 
Symbol 
ID5733073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1354081 
End bp1355538 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content53% 
IMG OID641278320 
Productextracellular solute-binding protein 
Protein accessionYP_001543956 
Protein GI159897709 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATC GCAGTTTCCT GTTGTTACTG CTGATCAGCA TGCTGATCGC CGCCTGTGGT 
GGTAGCACAA CGCCTACGAC CGCACCAACC GCTGACACTG CTGCTCAAGC CACCACGGCT
CCAGCCGCAA CCGAAGCACC AGCCGCGACC GAAGCACCAG CCGCAACCGA AGCACCAGCC
GCAACCAAAA CTACAGATAG CACTGGCAGC GTCCCAGCCG TCAACCCTAA CTATGCCGAA
TTAGTCCGCG CCGAAGCTGG CGAATTCAAA GGCACTAAAG TCAGCATTTT CGGTTTGTGG
GTTGAAGCTG AAGACACCGC CTTCAATCAA ACCTTGGCTG CTTTCGAAGC CCGCACTGGC
ATCGATGTTC AGTACGAAGG CTCGAAAGAT TTCGAAACTT TGATTCCTGT TCGCGCCGAA
GGCGGTAACG CCCCTGACAT CGCTGGTTTC TCACAACCAG GCTTGATGGC AACCTTGGCT
CGCAAGAACA AGTTGGTCGA TATCTCAAGC TTCATCAAAC CAGAAGACTT GCAAAAAGAT
TACATCCAAT CATGGATCGA TCTTGGCTCA GTTGATAACA CCCTCTACGG TGTCTTCTTC
CGCGCCAGCA CCAAGAGCAT CGTCTGGTAT CCAGTTCCAG CCTTCGAAGA AGCTGGCTAC
ACCATTCCAG AAACCTGGGA TGAATTGACC GCTTTGAGCG ACAAGATGGT TGCTGATGGC
AACACTCCAT GGTGTATCGG CTTGGAACAC GGCAACGTGA CCGGTTGGGT GGCTACCGAC
TGGATGGAAG ACATCATGTT GCGCACCGCT GGCGCTGAAA CCTACGACAA GTGGGTCAGC
CACGAAATCC CATTCACCGA CCCAAGCGTC AAGAACGCCG CTGAAGTCAT GGGTAAAATC
TGGTTCAACG AAACCTACGT TGCCGGTGGC CGCGAAGGTA TCTTGACCAC CACCGTGGCC
GATACCCAAA CCCCAATGTT CAACGCCGAC AAACCAGGCT GCTGGTTGCA CCGCCAAGCT
GGCTGGATTC CATCATTCTT CCCAGAAGGC AAGAAAGCTG GTACCGACAG CAACTTCTTC
TACTTGCCAC CAATTGATCC AGCTCAAGGC AAGCCTGTGT TGGGTGGTGG CGACGTGTTC
GCAATGTTCA ACGACCGCCC TGAAGTTCGC GCCGTGTTGC AATACTTGGC TAGCCCAGAA
TCAGCCAAAG GCTGGGTCGA AGCTGGTGGC TTTATCTCAC CAAACAAGAG TGTAGATTTG
GCTTGGTATG GTAACGACAT CGACCGCCGC CAAGCTGAAA TCATCAAGGA AGCTACCGTA
TTCCGCTTTG ATGCTTCTGA CTTGATGCCA GCCGAAGTTG GCGTAGGTAC CTTCTGGAAA
GGTATGGTCG ATTACGTGAA CAATACCGAA ATCGACACCG TGTTGCAAAC CATCGAAGAA
AGCTGGCCAC AAAACTAA
 
Protein sequence
MKHRSFLLLL LISMLIAACG GSTTPTTAPT ADTAAQATTA PAATEAPAAT EAPAATEAPA 
ATKTTDSTGS VPAVNPNYAE LVRAEAGEFK GTKVSIFGLW VEAEDTAFNQ TLAAFEARTG
IDVQYEGSKD FETLIPVRAE GGNAPDIAGF SQPGLMATLA RKNKLVDISS FIKPEDLQKD
YIQSWIDLGS VDNTLYGVFF RASTKSIVWY PVPAFEEAGY TIPETWDELT ALSDKMVADG
NTPWCIGLEH GNVTGWVATD WMEDIMLRTA GAETYDKWVS HEIPFTDPSV KNAAEVMGKI
WFNETYVAGG REGILTTTVA DTQTPMFNAD KPGCWLHRQA GWIPSFFPEG KKAGTDSNFF
YLPPIDPAQG KPVLGGGDVF AMFNDRPEVR AVLQYLASPE SAKGWVEAGG FISPNKSVDL
AWYGNDIDRR QAEIIKEATV FRFDASDLMP AEVGVGTFWK GMVDYVNNTE IDTVLQTIEE
SWPQN