Gene Apar_0821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0821 
Symbol 
ID8413686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp904154 
End bp905809 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content45% 
IMG OID645022403 
ProductMonosaccharide-transporting ATPase 
Protein accessionYP_003179841 
Protein GI257784624 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4211] ABC-type glucose/galactose transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00463034 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAAAAA CAGGTGGATC TGTTCTGACG GCTGAGCAGG AAAAAGAGCT GCTAAAGCCT 
ATTGATCAGA AGATTGGTTC CATTCAGGCT CAAATTGATG AGCTTCGCGC AAATGGTACC
AATAAAGTAA TTTCAACGTT GAGCGCTATT GAGTCTACAA AGCGCGATAA GTCCATTTCT
GCCGAAGAGC GTACCACGCT TATTGAGGGC TACAAAACTG AACTTGAGCT TGCAAAGAAG
GTTGAGTCCG AGAACAATGC TCAGGTCTCC AAGCTTATTG CAGAGGCAGA GGCTTACCTC
AAGGAACACT ACAAGAGCGA GTATCTAGAG CCCGTTAAGG CTAGCTGTGC TGTCGAGAAG
ACTCAGGCCA AGCAGAATTA TGAGGCTGCT CTTGATCGTC TTAAAAAGGA GCACGAAGAA
GCTGTCAGGA AGACTTCTGA CGCTCAGGAA ATCAAGGACG AGAAGTACGT TTACAAGAAC
CGTCAGTTTG ACGCTAAGGT CAACTATCAG AAAGATCTTC AGCGCATCAA AGATCGTGCT
CATAATGCAT TTAGTCATGA GTATCACCTC ATCGACCTTC TCAGGATGTC TAAGTTTACT
CCACTGGAGT CTCAGGCTCA GAAGTGGGAG AACTACAAGT ACACCTTCAA CACCCGTTCA
TTCCTGCTTC AGAACGGTCT GTACATTGTT ATTCTCCTGG TATTTATCGC CCTTTGTATT
ATTACTCCAG CAGTCAAGGG CACTCAGCTT TTGACATACT CCAATGTCAT TAACATTCTT
CAGCAGGCAT CTCCTCGAAT GTTCTTGGCT CTTGGTGTAG CTGGATTGAT TTTGCTTACC
GGAACTGACC TTTCTATTGG TCGTATGGTT GGTATGGGTA TGACCGCTTC AACCATTATC
ATGCATCAGG GTATCAATAC TGGTCAGGTC TTTGGTATCA CTTTTGATCT TACTGGTGTT
CCAATTCCTG TCAGAATCAT TATGGCTTTG GTCACTTGTA TTGTTCTGTG TACTTGCTTC
ACTAGTATTG CTGGTTTCTT TACCGCCAAG TTTAAGATGC ACCCATTCAT CTCGACCATG
GCTAATATGC TGATTATCTT TGGTATTGTT ACCTATGCAA CAAAGGGTGT TTCGTTTGGT
GCCATCGAGC CATCTATTCC AGATATGGTC ATTCCACGAA TTGGTAAATT CCCATCTATC
ATTTTGTGGG CAATTGCCGC TATCGCTATT GTTTGGTTCA TTTGGAACAA GACTACCTTT
GGTAAGAACC TCTACGCTGT AGGTGGAAAC CCTGAGGCAG CAGCTGTTTC TGGTATTTCA
GTCTTTAGAG TTATGGTTGG CGCTTTTGTC ATGGCTGGTA TTCTTTATGG ATTTGGTTCA
TGGCTCGAGT GCATGCGTAT GGTTGGCTCT GGTTCAGCAG CTTATGGTCA GGGCTGGGAT
ATGGACGCAA TCGCGGCCTG CGTTGTTGGC GGCGTTTCGT TTACGGGTGG TATTGGTAAG
ATCTCTGGTG TCACTACAGG TGTTCTTATC TTTACTGCAC TGACTTACGC TTTGACAATT
CTTGGTATTG ATACCAACCT TCAGTTTGTC TTCTCGGGCG TCATCATTCT GACTGCTGTC
ACCCTTGACT GCTTGAAGTA CGTTCAGAAG AAGTAG
 
Protein sequence
MPKTGGSVLT AEQEKELLKP IDQKIGSIQA QIDELRANGT NKVISTLSAI ESTKRDKSIS 
AEERTTLIEG YKTELELAKK VESENNAQVS KLIAEAEAYL KEHYKSEYLE PVKASCAVEK
TQAKQNYEAA LDRLKKEHEE AVRKTSDAQE IKDEKYVYKN RQFDAKVNYQ KDLQRIKDRA
HNAFSHEYHL IDLLRMSKFT PLESQAQKWE NYKYTFNTRS FLLQNGLYIV ILLVFIALCI
ITPAVKGTQL LTYSNVINIL QQASPRMFLA LGVAGLILLT GTDLSIGRMV GMGMTASTII
MHQGINTGQV FGITFDLTGV PIPVRIIMAL VTCIVLCTCF TSIAGFFTAK FKMHPFISTM
ANMLIIFGIV TYATKGVSFG AIEPSIPDMV IPRIGKFPSI ILWAIAAIAI VWFIWNKTTF
GKNLYAVGGN PEAAAVSGIS VFRVMVGAFV MAGILYGFGS WLECMRMVGS GSAAYGQGWD
MDAIAACVVG GVSFTGGIGK ISGVTTGVLI FTALTYALTI LGIDTNLQFV FSGVIILTAV
TLDCLKYVQK K