Gene Apar_0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0079 
Symbol 
ID8412922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp89282 
End bp90721 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content46% 
IMG OID645021646 
ProductPTS system, lactose/cellobiose family IIC subunit 
Protein accessionYP_003179106 
Protein GI257783889 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAG ATAATGGCCA AGCATCGTTT CTAGATAAAT TTGCAGAAGT TTCTGCAAAA 
GTAGGCAATC AAGTTCACTT GAGAAGCCTA CGCGATGCAT TCGCAACTGT CAGCCCAATT
TACATTTTGG CTGGTATTGC AGTACTGATC AACAACGTTC TGTTCCCACT CATCTTTGCT
AATGATCCAG TCACCTTGGC TAATTTCAAG GTTTGGGGTG CAGCGATTGC TCAGGGAACT
CTGAGTTTCT CGGCAGTTAT TCTTGCAGGC ATCATTGGTT ATTGCCTTGC TCGTAATAAG
CGTTTTGAAA ACGCAATTTC TTGCGTTGTT ATTGGCATTG CTGCTCTTAT TATCATGATG
CCTCAGAGCA TTGTTGCCAC CGCAGGCGCT GTTCTTCACG CAACTAACGC TGCTAACGCA
GCTGCTGCAG CAGTAACACT TCCTGTGGCT GAGGTTGCTA AGCTTCTGCC AAATGACTAT
GCAGTCACTG GCGTCGATGT AACAGGCGCT TTTTCCTCCT CCTTCACTGG TACTAACGGT
CTGTTTGGTG CAATCATTAT TGGCCTGCTT TCGACCACAA TTTTCATTAA GCTCTCTAGC
GTTAAACAGC TGAAGGTCAA CCTTGGTGAG GGTGTTCCAC CAGCGGTTGC AGATTCTTTT
AACACCATGA TTCCTATGAT GTTGACCTTG AGTGTTTTTG GTATTGCTTC TGCTCTTCTT
GCAGTTTGTG CAGGTACTGA CCTTATGACC ATCATTGCAA CAAGCATTTC CGCCCCACTG
AAGGGCCTTA TGAATGCTGG TCCATTTGCT GTCATCGTCA TCTACACCTT TGCAAACCTT
CTCTTCTGCC TTGGTATTCA CCAGTCCACC ATCTCTGGTG TTCTGATTGA GCCAATCCTG
ACCATGCTTA TTGTTGACAA CATGGCAACC TTCGCAGCTG GTCAGCCAAT CCCTCAGGAT
CACTACATGA ACATGCAGAT CATCAACACC TTTGCGCTGA TTGGCGGTTC TGGTTGTACT
CTGATGCTGC TTTTTGACAC CTTTATCTTC TCCAAGAACA AGGCTTCTAA AGACGTTGCG
GCACTTTCTC TTCTCCCAGG TATCTTTAAC ATCAACGAGC CAGTTATTTA TGGTTACCCA
ATTGTCTTTA ACCTTCCTTT GATGATTCCA TTTGTTCTTG TACCAGATTT GTTCATCGGT
CTGACCTACC TGCTCACCAA CCTTGGTTGG ATTAGCCCTT GCGTTGCAAT GGTTCCTTGG
ACCACTCCAG TCTTTTTGAG TGGTTGGTTA GCAACCGGTG GCGACGTTCG TGCTGTTATT
TGGCAGATCG TCGAGGTTCT TCTCGCAATG GCAATTTACC TGCCATTTAT GAAGATCTCC
GAGCGCGCAC AGGTCAAGCA GGCTGAGGCT CTTGCAGAGA AAGCTCAGGA TGCAGAGTAG
 
Protein sequence
MAKDNGQASF LDKFAEVSAK VGNQVHLRSL RDAFATVSPI YILAGIAVLI NNVLFPLIFA 
NDPVTLANFK VWGAAIAQGT LSFSAVILAG IIGYCLARNK RFENAISCVV IGIAALIIMM
PQSIVATAGA VLHATNAANA AAAAVTLPVA EVAKLLPNDY AVTGVDVTGA FSSSFTGTNG
LFGAIIIGLL STTIFIKLSS VKQLKVNLGE GVPPAVADSF NTMIPMMLTL SVFGIASALL
AVCAGTDLMT IIATSISAPL KGLMNAGPFA VIVIYTFANL LFCLGIHQST ISGVLIEPIL
TMLIVDNMAT FAAGQPIPQD HYMNMQIINT FALIGGSGCT LMLLFDTFIF SKNKASKDVA
ALSLLPGIFN INEPVIYGYP IVFNLPLMIP FVLVPDLFIG LTYLLTNLGW ISPCVAMVPW
TTPVFLSGWL ATGGDVRAVI WQIVEVLLAM AIYLPFMKIS ERAQVKQAEA LAEKAQDAE