Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3572 |
Symbol | |
ID | 9157751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3681608 |
End bp | 3682657 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | folate-binding protein YgfZ |
Protein accession | YP_003648489 |
Protein GI | 296141246 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0682209 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTCGATT CCGAACATGT CGTGTCCGGT ATAGCGAATC CCGAGGACGC CGTGGATGCG GGTGTGACCT GGCATTACGG CGATCCGTTC GGTGAGCAAC GGGCTGCTGA ACGCGGTGCC GCGGTGGTCG ACCGGTCGAA TCGGCGGGTG ATCACGCTCA CCGGCCCCGA CCGGCTGTCC TGGCTGCACT CGATCACCAG TCAGCACCTC ACCGCCCTCC CCGACGGCGG GTCGGTACAG AACCTCAACC TCGACGGCAG CGGCCGGGTA CTCGACCACT TCTGGGTCAC CGACTCCGAC GGGACCGCCT ACCTCGACAC CGAGCCCGCG ACGCTCGCGC CGAAGGAGGC GCCCCTCTCC CCTGATCTGG GCACGTACCT GCAGCGGATG GTGTTCCGGG CCGACGTGCA GGTGCAGGCG CGCGACGACC TCGCGGTGCT CACCGTGTTC GGCCCCGACG CCGCGACCGT CGCGGAGGCG GTGCCCGGCG TGCGGCGCAG CGAGCCGGAC GAGGTCAACC TGATCATCGA GCGTGCGGCC GTCCCGGACG CGATCACCAC GCTCGTCGCC GCGGGCGCCC GCCCGGTCGG GACGTGGGCG TACGAGGCCC GCCGCGTCGC CGCCGCACAC GCTCGTGCCG GACTCGACAC CGACGACAAG ACCATTCCGC ACGAGGTGAA TTGGATCGGC ACTGCGGTCC ACCTGGACAA GGGCTGTTAC CGCGGGCAGG AGACGATCGC GCGGGTGCAC AACATCGGCC GCCCGCCGCG CCGGCTGGTG CTGCTGCATC TCGACGGTTC CGCCGACGCC ACGCCCGCCA CCGGCGACCC GGTGACGGTG GACGGTCGCA CCGTCGGGCG GCTCGGCACC GTGGTCGAAC ACGCCGACTA CGGCCCGATC GCACTCGCTC TGATCAAACG GACCATCCCG GACGACGCGA AGCTCGAGGC GGGTGGCGCC GCGGCCGCGA TCGACGCGGA CGTCACGCCC GCCGCCGACA CCGCCGAGCA AGCCGGACGG TCCGCCGTCG ACCGGCTGCG CGGCCGGTGA
|
Protein sequence | MVDSEHVVSG IANPEDAVDA GVTWHYGDPF GEQRAAERGA AVVDRSNRRV ITLTGPDRLS WLHSITSQHL TALPDGGSVQ NLNLDGSGRV LDHFWVTDSD GTAYLDTEPA TLAPKEAPLS PDLGTYLQRM VFRADVQVQA RDDLAVLTVF GPDAATVAEA VPGVRRSEPD EVNLIIERAA VPDAITTLVA AGARPVGTWA YEARRVAAAH ARAGLDTDDK TIPHEVNWIG TAVHLDKGCY RGQETIARVH NIGRPPRRLV LLHLDGSADA TPATGDPVTV DGRTVGRLGT VVEHADYGPI ALALIKRTIP DDAKLEAGGA AAAIDADVTP AADTAEQAGR SAVDRLRGR
|
| |