Gene Apar_0933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0933 
Symbol 
ID8413804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1046002 
End bp1048041 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content49% 
IMG OID645022521 
ProductABC transporter related 
Protein accessionYP_003179953 
Protein GI257784736 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.311537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACAAA TTAGAAAAAA ACAAGTCGAG AAGCTCGAAG AAGCTGCAGC AAAGGGCTCA 
ACCTTTGGTG GCTTTAAGAA GATGAAGACG GCTTCCAAGA TTGCTCTTGT CATTCTTGTC
TTTGTTGTTC TTGCTTCTGT CTTAGCAAAC GTTATTGCTC CACATGACCC ACTTGAGATT
TTCAACGCTC GTCAGGCTCC AGGTAATGGA TTCATTTTTG GTACTGACGA TAAGGGCCGC
GACATTCTTT CTCGTATGCT CCATGGTGGT CAATATTCAC TGGTCATTGG TTTTGGTGCT
ACGGGTATGG CACTGCTATG TGGCTCAATT ATTGGTGCAA TTGCAGCTGT TTCCAGAAAA
GCTGTTTCTG AGGTTATCAT GCGTGTGCTC GACATCATCA TGTCGTTCCC AGGCATTGCT
CTTGCAGCAG TCTTTGTTTC TATTCTGGGT AACAGCGTTC CTTCAATCAT TTTTGCCATT
GGCTTTATGT ATACGCCACA GATTGCACGT ATTGTTCGCG CAAATATCGT CAGCGAGTAC
GGCGAAGATT ACGTAAGAGC AACCATTGTT TCTGGTGCTC GTGCACCGTG GATTTTGTGG
AAACATGTTC TTCGCAACTG CATTGCTCCA ATCATGGTCT TTACCGTTAC GCTTGTTGCA
GACGCAATCA TCTTTGAGGC ATCTTTGACC TTCATTGGTG CAGGTATTGC AGAGCCAACT
CCTACATGGG GTAACATCCT TGCTGATGCT CGTGCCGGCG TTCTTGCTGG TCGTTGGTGG
CAGGCATTCT TCCCAGGCTT GGCCATTATG ATGACCTGCT TGGCACTGAA CATCCTCTCC
GAGGGTCTTA CTGACGCTAT GGCAGCAGCG CCTGGCGCTC CTGTTGATAC AGAGAACTCT
GACTCCAGGC GCGCAGATGA CATTCTTGCA TCTGATCCAG TTCGTGCATA TGCGGAGCAA
GCAGAGTCGC TGGAGCGCAG GCTTAATGCA CTCAAAGAGG TTGAGCTGTC CAGAACTGAT
AGGCGTAAGC CAGACTTTGA TGTAAAGCCA TTACTGAGCG TTAAGGATCT TTGCATTAGC
TTTGAGTCTC ACGGAGACGT TAAGGTTGTT GACCACGTAA GCTTTGATGT TCGCCCTGGT
CAGTGCATGG CGCTGGTTGG CGAGTCTGGC TGCGGCAAGT CTATTACCAC TAAGGTCATC
ATGGGTTTGA CTGATCCAAA AGAGACCATT ACTGGTGAGG TCCTCTACAA GGACCAAGAC
CTCCTGAAGC TTTCCAAAGA AGAGCACCGC AAGCTGCTTG GACATGAGCT TGCAATGGTG
TACCAGGACG CTTTGTCTTC CCTGAACCCA TCTATGCTTA TCTCTTCTCA GATGAAGCAG
CTTACCAGCA GGGGAGGTAC ACGTTCTGCA GAGGAGCTTC TGGAGCTTGT AGGTCTGGAT
CCAAAGCGTA CGCTTGAGTC CTATCCTCAT GAGCTTTCCG GTGGTCAGCG TCAGCGTGTT
CTGATTGCTA TGGCACTTAC CCGCGATCCT TCGCTGGTTA TCTGTGATGA GCCAACAACC
GCTTTGGACG TTACTGTACA GAAGCAGGTT ATCAAGCTCT TAAATGACCT CCAGCGTCGT
CTTGGTTTTG CAATGATTTT TGTTAGTCAT GACCTTGCAC TTGTTGCAGA GGTTGCTTCT
GAGATTACTG TTATGTATGC CGGTCAGGTT ATTGAGCAGG CGGCTACTAC TGAGCTTTTG
ACCAACCCTG TTCACGAGTA CACCCGTGGT CTTCTCGGCT CTGTTCTTTC TATTGAGGAA
GGCGCGCAGG ACGGTACCAG GCTTCACCAG GTACCAGGTT CTGTTCCTTC TCCAGAAGAC
TTTCCAACAG GCGATAGGTT TGCTCCTCGT TCAAGCCATC CAGATCTGGG TCTTGAGGTA
CATCCTGTTA TTAAGGAGAT TCCTGGTAGA CATCATCGTT TCTCCGAGCT GCCTGATGAG
TATTTGAAGG AGCATGGTCT TGTGCCTTAC CTTGAGCGTT CTCAAAAGGA GGTGAGGTAA
 
Protein sequence
MVQIRKKQVE KLEEAAAKGS TFGGFKKMKT ASKIALVILV FVVLASVLAN VIAPHDPLEI 
FNARQAPGNG FIFGTDDKGR DILSRMLHGG QYSLVIGFGA TGMALLCGSI IGAIAAVSRK
AVSEVIMRVL DIIMSFPGIA LAAVFVSILG NSVPSIIFAI GFMYTPQIAR IVRANIVSEY
GEDYVRATIV SGARAPWILW KHVLRNCIAP IMVFTVTLVA DAIIFEASLT FIGAGIAEPT
PTWGNILADA RAGVLAGRWW QAFFPGLAIM MTCLALNILS EGLTDAMAAA PGAPVDTENS
DSRRADDILA SDPVRAYAEQ AESLERRLNA LKEVELSRTD RRKPDFDVKP LLSVKDLCIS
FESHGDVKVV DHVSFDVRPG QCMALVGESG CGKSITTKVI MGLTDPKETI TGEVLYKDQD
LLKLSKEEHR KLLGHELAMV YQDALSSLNP SMLISSQMKQ LTSRGGTRSA EELLELVGLD
PKRTLESYPH ELSGGQRQRV LIAMALTRDP SLVICDEPTT ALDVTVQKQV IKLLNDLQRR
LGFAMIFVSH DLALVAEVAS EITVMYAGQV IEQAATTELL TNPVHEYTRG LLGSVLSIEE
GAQDGTRLHQ VPGSVPSPED FPTGDRFAPR SSHPDLGLEV HPVIKEIPGR HHRFSELPDE
YLKEHGLVPY LERSQKEVR