Gene Xaut_4036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_4036 
Symbol 
ID5424401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp4464890 
End bp4465885 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content67% 
IMG OID640883290 
ProductABC sulfate transport system, periplasmic binding protein 
Protein accessionYP_001418915 
Protein GI154247957 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.7213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0122121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGAT TGCTCAATCG GCGCACGCTG CTCAGCGCGC TCGCAGGCTC CGCCGGGGCC 
CTCGCCCTGC CGCATCTGCC GGCGCTGGCG GCCCCGGCGC AGGGGCTCGA AATCCTCGGT
GCGCCCAATG GCTCCACCAT CGTGCTGCTG CGCCTGCTGC AGTCCGGCGC GCTCGACCAG
GTGGCGCCCG GCGCGTCCTT CCGCCTGTGG CGCGACACGG ACGAGTTGCG CGCCGCCATC
GTCTCCGGCC GCACCAGCCT GTTCACCACC CCCACCCATG TGCCGGCGAA CCTCGCCAAT
CGCGGCTTGC CGCTGAAGCT GTTCGCGATC CTGTCCATGG GCCATCTGTT CGTGGTGTCG
GGGGACGAAG GCATCAAGTC GTTCAAGGAC CTTGCCGGCA AGGAGCTGGT CGGCTTCTTC
AAGAACGACA TGCCCGACCT CGTCTTCCGT TCCATCGCCA AGGGCTACGG CATGGATCCG
GACAAGGACA TGAGCATCAC CTATGTGCGC ACCCCCATGG AGGCGGCGCA GATGCTGGCC
GCCGGGCGCG CCACCACCGC CATCCTTTCC GAGCCGCCGG CTACCGCAGC CATCCTGATG
GCGAAGAAGG AGGGCCGCAT CCTCAACCGC GCCATCAGCC TGCAGGACGA CTGGAAGGTG
CAGCACAAGG GCCTCGGCCT GCCCATGGCC GGCATCGCCG TGCACGAGCG CCTGATCGAG
CACAGCCCCG AGCTGATCGC GGCGCTCGGT GCGGGCCTGC CCGGAGCCCG CGACTGGGTG
ATGGCCAACA AGAGTGAAGC AGGCCAGCTC GCCGAGCAGA AGATGGACGT GAAGGCCCAC
ATGTTCGCCA ACGCCCTCGA CCACTTCAAC GTGGTGGCGG AACCGGCGGC GAAACAGAAG
GCCGGCCTCA TCGCCTTCTA CGAGACCCTT TTGGCCTTCG AGCCGGATGC ATTGGCCGGC
AAGCTGCCGC CCGACAGCTT CTACATGAAC TTCTGA
 
Protein sequence
MNGLLNRRTL LSALAGSAGA LALPHLPALA APAQGLEILG APNGSTIVLL RLLQSGALDQ 
VAPGASFRLW RDTDELRAAI VSGRTSLFTT PTHVPANLAN RGLPLKLFAI LSMGHLFVVS
GDEGIKSFKD LAGKELVGFF KNDMPDLVFR SIAKGYGMDP DKDMSITYVR TPMEAAQMLA
AGRATTAILS EPPATAAILM AKKEGRILNR AISLQDDWKV QHKGLGLPMA GIAVHERLIE
HSPELIAALG AGLPGARDWV MANKSEAGQL AEQKMDVKAH MFANALDHFN VVAEPAAKQK
AGLIAFYETL LAFEPDALAG KLPPDSFYMN F