Gene Apar_0014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0014 
Symbol 
ID8412854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp18834 
End bp20114 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content47% 
IMG OID645021581 
Producthypothetical protein 
Protein accessionYP_003179044 
Protein GI257783827 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4573] Predicted tagatose 6-phosphate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.109361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGC TACCAATCAA AAAAGCTGTT GAAGGGTTGC TTAAACTTCA GGACACAGGA 
AGGTCCGCCA CGCTTTTAGG AATTGGACCA ATGTCGCCCA ACTTGCTTCA GGCAGCTTTT
GAACTTGGTA GAGATTGCGA TTTTCCTCTA ATGTTCATTG CATCTCGAAA CCAGGTAGAC
CTTGACGAGC TTGGTGGAGG ATACGTAAAC GCTTGGGATC AGAAGCGCTT CTCGGAAGAT
ATTGCTGAAG CAGCTCAGAA GGTTGGTTTT GACGGTCTGT ATTATCTCTG CCGTGACCAC
GGTGGTCCTT GGCAGCGCGA CGAGGAGCGC AATGCTCACC TTCCAGAAGA TGAGGCCATG
GAGCTTGCTA AGAAGTCTTA TCTTGCCGAC ATGCTTAATG GCTTTGACCT GCTGATGATT
GATCCAACCA AGGATCCCTT TGAGATTGGT AAGGTCATTC CGCTAGATGT GGTTCTTCGT
CGTACGGTTG ATTTGATTGA GTGGTGCGAG AAGGAACGTG TTTCTCGCGG TCTTCCCGAG
ATTGGCTATG AAGTTGGTAC CGAGGAAACA AACGGTGGCT TGACCTCAAC CGATAAATAC
CACACCTTTA TTGAGCAGCT TAAGAGTGAG CTGACTGCCA AGGGTTTGCC TATGCCAACT
TTTATTGTGG GACAGACGGG AACGCTCACC CGTCTTACTG AGCAGGTTGG CCATTACGAT
TTTGAGGCTG CATTTAGCTT GTCTAAGATG GCCAAGAGCT ACGGCGTTGG TCTTAAGGAG
CACAATGCAG ACTATCTTGA CGACGTAACA CTACTTGAAC ACACTCCAGC AAACGTTACC
GCTTCAAACG TAGCTCCACA ATATGGAACG GAAGAGACTC GTGCATATCT GAAACTTTGC
GATGTTGAAG ATCTTTTGGT TAAAGAAGGT CTATTGAAGT CGGATGAAGT TTCTGGCTTG
AGGAATACCC TGTTAGTAAA GGCAATTGAG ACTGAACGCT GGCGTAAGTG GATGGTAGGT
AATCAAGTTA ATCTGACCGT TGAGCAGATT CTTGCTGATC ACAAACTCTC ACTAGATATT
CTTGATATTT CCGGTCACTA TGCGTTCAAT GATGACGAGG TCAAAGCTGC AACTGAGCAC
CTGTATAAGA ACCTTGCCCA GTTCAATATT GATGGTCAGC GCTTTGTGGT TGATCACATT
AAGCGCCCTC TTCGCCAGTA CGTTGAATGC TACAGGCTTG AAGGAGTTAC TACGCGTATT
CGCGAGGCGC TGGCAGAGTA G
 
Protein sequence
MEKLPIKKAV EGLLKLQDTG RSATLLGIGP MSPNLLQAAF ELGRDCDFPL MFIASRNQVD 
LDELGGGYVN AWDQKRFSED IAEAAQKVGF DGLYYLCRDH GGPWQRDEER NAHLPEDEAM
ELAKKSYLAD MLNGFDLLMI DPTKDPFEIG KVIPLDVVLR RTVDLIEWCE KERVSRGLPE
IGYEVGTEET NGGLTSTDKY HTFIEQLKSE LTAKGLPMPT FIVGQTGTLT RLTEQVGHYD
FEAAFSLSKM AKSYGVGLKE HNADYLDDVT LLEHTPANVT ASNVAPQYGT EETRAYLKLC
DVEDLLVKEG LLKSDEVSGL RNTLLVKAIE TERWRKWMVG NQVNLTVEQI LADHKLSLDI
LDISGHYAFN DDEVKAATEH LYKNLAQFNI DGQRFVVDHI KRPLRQYVEC YRLEGVTTRI
REALAE