Gene Apar_1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1198 
Symbol 
ID8414076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1343908 
End bp1345395 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content50% 
IMG OID645022792 
Productphosphotransferase system EIIC 
Protein accessionYP_003180217 
Protein GI257785000 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.742148 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACA CCAAGCTTTC AGAACAGATT CTTGCTGCCG TCGGTGGTAA AGAAAATGTT 
CAGAGCAACA TGGTCTGCAT GACCAGGCTC CGCGTTAAGA CCGTAGATCC TTCAAAGGTT
GATTCTGAGG CCATCAAGGC TATTGATGGC GTCATGGGTC TGGTCGAGGA TGCAGAGTAC
CTTGAGGTTG TTCTTGGTCC TGGCGTTGTC AACAAGGTTA TTGTTGAGTT CTCCAAGCTT
ACCGGTGTTG CTGCTGGCGA CGCATCTGAG GATGATGTTG TTTCTGCAGC TAAGGACAAC
AAGGCAGCTC AGAAGGCTAA GTATGAGAAC AAGCCTGTTC AGCGTTTCCT AAAGAAGATT
GCTAACATCT TTGTTCCACT GCTTCCTGGC ATCATTTCTG CAGGTTTGAT TAACGGTATC
ATCAACGTTA TCAACTTCTC CACTGGTAAG GCTTATGCCA ACGAGTGGTG GTTTGCAGCT
ATCTGGACTA TGGGTTGGGC GCTCTTTGCT TATCTGCCAA TTCTCGCTGG CGAAAACGCT
GCCAAGGAGT TTGGTGGTTC TCGCGTTCTT GGTGCTATGG CTGGTGCACT TTCCATTGCT
AACGCTGGTA TGCCTCTTCT TGCTTCCAAG ACTGTTGACG GCGTTGCAAC TCGTTTGGTT
CACCTTCCAT TTAGCCTCCC AACTGTTGCT TTCAAGGAGG GCGCATTTAT CGTTGCTTCC
TCTGACCTGT TCAACGCAGC AGCAGGCGGC ATGATTGGTG CAATCATCTG CGCAATCTTC
TTTGCATTCC TGGAGAAGAA CCTGCACAAG GTTATGCCAA GCGTTCTTGA CACCTTCCTG
ACCCCACTTT GCACCGTCAT CATCGGTGTC ATTGGTTCTG TCCTGATTCT TCAGCCTGCA
GGTGCATGGC TCACTCAGGC AATCTTCTTT GTTCTTCAGT TCTTCTATGA CAAGCTTGGC
GTCTTTGGTG CTTACCTGCT TGGTTCTACC TTCTTGCCAC TGGTTTCTGT TGGTCTTCAC
CAGGCACTTA CACCAATCCA CGTTATGCTT AACAACCCAG AAGGCCCAAC TCAGGGTATC
AACTACCTGC TTCCTCTGCT GATGATGGCA GGCGGTGGCC AGGTTGGTGC TGGTCTTGCT
ATTCTCTTCA AGACTAAGAA CAAGCGCGTT AAGAAGTACC TCACCGAGTC CATCCCAGTT
GGTATGCTCG GCATCGGCGA GCCTCTTATG TACGCAGTTA CTCTGCCTCT TGGTAAGCCA
TTCATTACCG CATGCCTTGG CTCTGGTGTT GGCTCCGTAA TTGCTTACCT CTTCCACCTT
GGCACCGTTT CCCAGGGCGT CTCTGGCCTC TTTGGTCTGC TGATTGTTCA GCCTGGTAAC
CAGGTCTTCT ACCTGCTTGC AATGCTTCTT GCTTATGCTG CAGGCTTTGC ACTTACTTGG
TTCTTTGGCG TTGACGAGGA TCGTATCAAC GACGTCTTTG GCGAGTAA
 
Protein sequence
MDYTKLSEQI LAAVGGKENV QSNMVCMTRL RVKTVDPSKV DSEAIKAIDG VMGLVEDAEY 
LEVVLGPGVV NKVIVEFSKL TGVAAGDASE DDVVSAAKDN KAAQKAKYEN KPVQRFLKKI
ANIFVPLLPG IISAGLINGI INVINFSTGK AYANEWWFAA IWTMGWALFA YLPILAGENA
AKEFGGSRVL GAMAGALSIA NAGMPLLASK TVDGVATRLV HLPFSLPTVA FKEGAFIVAS
SDLFNAAAGG MIGAIICAIF FAFLEKNLHK VMPSVLDTFL TPLCTVIIGV IGSVLILQPA
GAWLTQAIFF VLQFFYDKLG VFGAYLLGST FLPLVSVGLH QALTPIHVML NNPEGPTQGI
NYLLPLLMMA GGGQVGAGLA ILFKTKNKRV KKYLTESIPV GMLGIGEPLM YAVTLPLGKP
FITACLGSGV GSVIAYLFHL GTVSQGVSGL FGLLIVQPGN QVFYLLAMLL AYAAGFALTW
FFGVDEDRIN DVFGE