Gene Xaut_4660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_4660 
Symbol 
ID5420766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp5150891 
End bp5152513 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content64% 
IMG OID640883924 
Productextracellular solute-binding protein 
Protein accessionYP_001419537 
Protein GI154248579 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.845725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.101671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCC GCGAATTCGT GAAGTCTGCG TCTGCCACCG CCGTTGCCAC CGGAACCGGT 
GTCGCCGCCC CGGCGGTCTT CTCCTCGGCA CAGGCGCAGG CGCGCAACGA GACGCTGCTG
ATCGTCTCCG AGAGCGGCCC CAACAACCTC GACATCCATG GTGTCGGCAC CAACGTGCCG
GGCTATGAGG CGAGCTGGAA CACCTACGAC CGCCTCATCA CCCACGAGAT GACCGAGAAG
GACGGGGTCC GCTACTACGA CCGCGACAAG CTGAAGGGCG AGCTCGCCGA GGACATGAAC
ATCGGCGACA TGTCGGTGAC CTTCAAGCTG AAGAAGAACG CCACCTTCCA GGACGGCACC
CCGGTCACCG CCAAGGACGT GAAGTGGTCG CTGGACCGCG CCGTCTCCGT GGGCGGCTTC
CCCACCTTCC AGATGAAGGC CGGCTCGCTG GAGAAGCCCG AGCAGTTCGT GGTGGTGGAT
GACCACACGG TGCGCGTGGA CTTCATCCGC AAGGACCGCC TCACCATCCC CGATCTCGCC
GTGATCGTGC CCTGCGTCAT CAATTCCGGG CTGGTGCAGA AGAACGCCAC CGAAAAGGAC
CCCTGGGGCC TCGAATACAC CAAGCAGAAC ACCGCCGGCT CCGGCGCCTA TCGCGTCACC
AAGTGGACCC CCGGCACCGA GGTGATCTTC GAGCGCTTCG AGGACTGGAA GGGCGGCCCG
CTGCCCAAGA TCAAGCGCGT GATCTGGCGC ATGGTGCCCT CCGCCGGCAA CCGCCGGGCG
CTGCTGGAGC GCGGCGACGC CGACATCTCC TACGACCTGC CCAACAAGGA TTTCGTGGAG
CTGAAGCAGG CCGGCAAGCT GAACATCACG TCGGTGCCCT ATTCCAACGG TGTCCAGTAC
ATCGGCATGA ACGTGAAGAA CCCGCCCTTC GACAATCTGA AGGTGCGCCA GGCCATCGCT
TACGCCATCC CCTACCAGAA GATCATGGAC GCCGCCCTGT TCGGCCTCGC CAAGCCCATG
TTCGGCGCCC CGGCGGATGC GCAGACCCAG GTCAAGTGGC CGCAGCCCAC CAAGTTCGTC
ACCGACCTCG CCAAGGCCAA GCAATTGCTG GCGGAGGCGG GCTATCCCGA CGGGCTGGAG
ACGACGCTGT CCTTCGACCT CGGCTTTGCC GGCGTGAACG AGCCGCTGTG CGTGCTGCTG
CAGGAAAACC TGGCGCAGAT CGGCATCAAG ACCACCATCA ACAAGATCCC CGGCGCCAAC
TGGCGCACCG AGCTGACGAA GAAGGTGCTG CCGCTGTTCA CCAACGTGTT CTCGGGTTGG
CTGGACTATC CCGAATACTT CTTCTTCTGG TGCTACCACG GCAACAATTC GATCTTCAAC
ACCATGAGCT ACCAGTCGGC GGCCATGGAT GCCTTCATCG ACGGCGCCCG TGCCGCCGCC
GCCAACGGCG ACAAGGCGGC CTATGATGCG GACGTGAAGG GCATGGTGGA CCTCGCCTTC
GCCGACGTGC CGCGTATCCC GCTCTACCAG CCCTATGTGA ACGTGGCGAT GCAGAAGAAC
ATCACCGGCT ACGAATACTG GTTCCACCGC CGTCTCGACT ATCGCGCTTT CCAGAAGGGG
TGA
 
Protein sequence
MNRREFVKSA SATAVATGTG VAAPAVFSSA QAQARNETLL IVSESGPNNL DIHGVGTNVP 
GYEASWNTYD RLITHEMTEK DGVRYYDRDK LKGELAEDMN IGDMSVTFKL KKNATFQDGT
PVTAKDVKWS LDRAVSVGGF PTFQMKAGSL EKPEQFVVVD DHTVRVDFIR KDRLTIPDLA
VIVPCVINSG LVQKNATEKD PWGLEYTKQN TAGSGAYRVT KWTPGTEVIF ERFEDWKGGP
LPKIKRVIWR MVPSAGNRRA LLERGDADIS YDLPNKDFVE LKQAGKLNIT SVPYSNGVQY
IGMNVKNPPF DNLKVRQAIA YAIPYQKIMD AALFGLAKPM FGAPADAQTQ VKWPQPTKFV
TDLAKAKQLL AEAGYPDGLE TTLSFDLGFA GVNEPLCVLL QENLAQIGIK TTINKIPGAN
WRTELTKKVL PLFTNVFSGW LDYPEYFFFW CYHGNNSIFN TMSYQSAAMD AFIDGARAAA
ANGDKAAYDA DVKGMVDLAF ADVPRIPLYQ PYVNVAMQKN ITGYEYWFHR RLDYRAFQKG