Gene Apar_0152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0152 
Symbol 
ID8412998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp176372 
End bp177796 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content53% 
IMG OID645021722 
ProductPTS system Galactitol-specific IIC component 
Protein accessionYP_003179179 
Protein GI257783962 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3775] Phosphotransferase system, galactitol-specific IIC component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0975812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAGTAG TAATTTCATT CTTCTCGTTC CTTCAGGGCC TCGGTGTTGC AGTCATGATG 
CCAATCATCC TGACCATCAT CGGTTGTGCT CTGGGAGCAG GCTTTGGTAA GAGCCTTAAG
GCTGGCCTTA TGGTAGGTGT TGGCTTTATC GGCCTTAACC TTGTCATCAA CCAGCTGTTT
GGTACCGCAA TTGGACCTGC AGTTCAGGAA ATGATTCACC GCTTTGGTCT GACTCTTAAC
GTCATCGACG TTGGTTGGCC AGCAGGTGCA GCAATTGCAC TTGGCACCAC CATTGGTCTG
GTTATCATTC CTTTGGCACT CATCGTTAAC CTGCTTCTTG TTTTGGTTAA CTTCACTCAG
ACTGTCAACG TTGATATCTG GAATTACTGG CACTACGCAT TCGTCGGTTC TCTTGTTGCA
AATGTTACCA ACAACATCAT GCTTGGCTTT GCTGCAGCAA TCATCGATGA GGTCATCCTG
CTTGTTCTTG CTGACGCAAC TGCAAAGGAT GTCCAGAAGG CGCTCAACAT GCCAGGCGTT
TCTATCTCTC AGGGCTTCTC GCTTGCTTAT GCTCCAGTTG CTATGGGTCT GAACTGGCTG
ATCGATAAGA TCCCTGGTGT TCGTGACATT GACATCGACG TCGAGTCCAT GCAGAAGCGC
TTTGGCGTCT TCGGCGAGCC ACTGTTCATC GGCACCGTCA TCGGCCTCAT CATCGGCTGC
GTTGCATACT TCGACGCTGC TGATATCGCA GGCTCCATCA CCAAGATCCT GACCATCGGC
GTCACCCTTG GTTCCGTTCT GATTCTTATT CCTCGTATGG CAGCTCTTCT TATGGAAGGC
CTCCTGCCAA TCTCCGAGGC AGCTTCAAAC TTCATCCAGC AGCGCGTTAA GAACCGCGGC
AACATCTGGA TTGGCCTCGA CTCTGCAGTT GCAGTTGGCC ACCCAGTAAC TCTGTCTCTG
GCTCTTATCT GCATTCCTCT GATGGTTCTG TTTGCACTTC TGCTTCAGCC AGTTGGCAAT
CAGACTCTGC CATTTGTTGA CCTTGCCGCT GGCACCTACA TGCTCGCAGT CGCAGTACCT
GTCTGCAAGG GTAACGGCTT CCGCGGCCTG ATCATCGGCA TTGTTTCCGT CATCATCGGC
CTGCTGATCT CCACCGCTCT TGCTCCACTC ATCACACAGT CTGCAACTCA GGCTGGCTTC
GACATCGCAG CTGCAGTTGG TAGCACCGGC GTTGCTGGTT CTTCCCTGAT CACCGTCCTT
TCCGACGGCG GCAGCCCACT GTCCGGCCTG TTTGTTCTGT TCTCCAGCAT CAACCCAATC
GTTGGCGTTG TAGTAACTGG CGCAATCGCA GTTGCACTTG CAGTGTGGAA TCGCAGCCGC
ATCCTCAAGG AGGCTGCTGC AATGCACGAG GCTCAGGAGG CTTAG
 
Protein sequence
MEVVISFFSF LQGLGVAVMM PIILTIIGCA LGAGFGKSLK AGLMVGVGFI GLNLVINQLF 
GTAIGPAVQE MIHRFGLTLN VIDVGWPAGA AIALGTTIGL VIIPLALIVN LLLVLVNFTQ
TVNVDIWNYW HYAFVGSLVA NVTNNIMLGF AAAIIDEVIL LVLADATAKD VQKALNMPGV
SISQGFSLAY APVAMGLNWL IDKIPGVRDI DIDVESMQKR FGVFGEPLFI GTVIGLIIGC
VAYFDAADIA GSITKILTIG VTLGSVLILI PRMAALLMEG LLPISEAASN FIQQRVKNRG
NIWIGLDSAV AVGHPVTLSL ALICIPLMVL FALLLQPVGN QTLPFVDLAA GTYMLAVAVP
VCKGNGFRGL IIGIVSVIIG LLISTALAPL ITQSATQAGF DIAAAVGSTG VAGSSLITVL
SDGGSPLSGL FVLFSSINPI VGVVVTGAIA VALAVWNRSR ILKEAAAMHE AQEA