Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_4035 |
Symbol | |
ID | 5424400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | - |
Start bp | 4463703 |
End bp | 4464593 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640883289 |
Product | extracellular solute-binding protein |
Protein accession | YP_001418914 |
Protein GI | 154247956 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.876944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0228737 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACAG GAACCGGTCA ATCAGCGCGT GGGAGTGCCC CCTTGCGCGG CCTGTGGGCG CCCCTTGCGG CGCTGGCGAT CTTCGCGGCC TGTGCGGCCC CCGCCGGTGC CCAGACCCTC GACCGGGTGG CGGGCGGAGA AGCCTTTCGC ATCGGCTATC GCCAGTTCGC CCCGCCCTAT TCCTACGCCG CCGCCAACGG CCAGCCCTCG GGCTATATCG TGGACCTGTG CCGCGAGGTG GCGGACGGGG TGAAGCGGAC GCTGAAGCTG CCGGCCATCA AGGTGGATTA TGTGAAGGTC ACCGCCGAGG ACCGCTTCGA GGCGGTGCGC GACGGGCGCA TCGACATCCT GTGCGAGCCG TCCTCCATGA CCATGTCGCG GCGAGCGCTG GTGGACTTCT CCCTGCCGAC CTTCCTTGAC GGGGCGGGCG TCGTCACCCG TGGCGCGCCG GTGAAGGGGC TGGAGGACCT CAAGGGCAAG AAGGTGGGCG TGCTGCGCGG CACCACCACC GAGGAGACCC TGCGCTCCAC CCTGGGCCAG ATGCGCATCG CCGCCGACAT CGTCACCGTC ACCGACCATC CCGACGGGCT CAAGCAGCTC GCCGATGGCA AGCTCGACGC CTATTTCGGC GATCGCGGCA TCCTCAACTA CCTGATCGCC AACAGCCCGG CCGGCAACCG CCTCAGCCTC TCCGACCAGT ATTTCACCTT CGAGACCTAT GCCCTCGCCC TGCCCCGCGG CGATCAAGCG TTCCGTCTGG TGGTGGACGC GACGCTCGCG GACCTCTACC GCACCGAGCG CATCCGCGAC ATCTATGCCA AGAGCTTCGG CAAGTTCCCG CCGGACCAGT TCCTCAACGC CCTCTTCGTC ATCAACGGCG TGCCGAAATA G
|
Protein sequence | MATGTGQSAR GSAPLRGLWA PLAALAIFAA CAAPAGAQTL DRVAGGEAFR IGYRQFAPPY SYAAANGQPS GYIVDLCREV ADGVKRTLKL PAIKVDYVKV TAEDRFEAVR DGRIDILCEP SSMTMSRRAL VDFSLPTFLD GAGVVTRGAP VKGLEDLKGK KVGVLRGTTT EETLRSTLGQ MRIAADIVTV TDHPDGLKQL ADGKLDAYFG DRGILNYLIA NSPAGNRLSL SDQYFTFETY ALALPRGDQA FRLVVDATLA DLYRTERIRD IYAKSFGKFP PDQFLNALFV INGVPK
|
| |