Gene Apar_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1246 
Symbol 
ID8414125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1395274 
End bp1396899 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content55% 
IMG OID645022838 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_003180262 
Protein GI257785045 
COG category[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCGG TATTAAATGT TGAGGACATC AAGCGCGTAG AGATTGCGCT GACACGCGTG 
GGCGTAAGCG TTTCTGAGCT CATGCACCGT GCAGGTTACG CTGCTGCTCA AGAAGCCCTT
GGCATGGGGG GAGACATTAG TAACGTTGTC ATCCTCGTAG GTCTTGGCAA TAACGGTGGA
GATGGCTGGG TGGCAGCAGA AGCGCTGCGC TCTAGGAACT GCAACGTTAA GGTGGTTACT
CCACTTGAGC CAGATCAGAT TTCCGGCGAT CTTGCACGTC AAATGGCGCA GCGCGCCGTT
CGCGCGGGAG TTTCGGTGCT TGTTGGTCCT TCTCGCCAGG AGCTTATTGA TCTGTTGGCA
ACAGCTGATG TGGTGCTTGA CTGCATGTTG GGTACCGGTT TTCACGGCAA AGTGAGAGCT
CCGTTTGATA TTTGGATTGA GTGCCTTAAT CAGTCTGGCG CTCGCGTGCT CTCTGTTGAC
GTTCCAAGTG GTCTTTCTGC GCAGAAGGGC CAGGTAGAAA GTGCGTGTGT TGTTGCTGAC
GTCACCGTTA CCATGATTGC GCTGAAGCCC GGTCTAATTG CTGATGCTGG TCGAGATGTT
TGTGGTTCTA TTGTTGTAGC TCCTTTGGCC GAGCAGACGG AGCGCTTGGT TGTTGAAGCA
GATCCTGTTG CATGGCGCGT TGACTTGGAA GATTATCTTG CTTCTGTTCC TGCTCAGCTC
AACGACTGTG ATAAGTATTC TCGCGGCTCC GTACTGGTGG TTGGTGGCTC TAGCCGCTTC
CCGGGAGCCG CTGTTTTTGC TGCTAAGGCG GCTGCTCGTG CGGGTGCCGG CTACGTTACG
CTGGCAGTCC CCGAGGCTAT TGTCAGCTGT GTTCAGATGA TGTTGCCAGA GGTTCCTGTT
ATTGGCCTCC CATGCGATGC TGAGGGCGTG TTTACTGAGG AGGCCGCTCC ACTGGTGCTG
CAGCTTGCTG CGATGCGTAC CGTTACGCTG GTTGGTCCTG GCATGCGCGT TTCTGGTGGA
ACCGTTAAGG TTACCTCTGC ACTGCTTGAC TCTGAGCTGC CCGTTATTGT TGACGCTGAT
GCACTCAACT GTATTGCTCG CCTGACTAAC AACAACCTTC CAGATTTCCC CGAGCTCACT
CGTCGTACCG CTCCGCTTAT CATGACCCCA CATCGTCGTG AGCTTGGCCG CTTGGTCAAT
CAGGTAGACA ATCCTCCTGC AAGCCTGGTG GCTCAGCTTG AGGCAGCTCG TAAGATTGTC
TGGGCAGATG GTGGCTCTGA GCTGGTTATT GTTGCTAAGG GTACTGCTAC CGGCTGTGTA
GGCGTCCAGA AAGCCGTGCT GCCAAAGCCT GGCCCTGTCA CGTTGGCAAC CGCTGGTTCT
GGCGATGTTC TTGCTGGTAC CATTGCTGGC CGATTAGCTC AGGTTGCCGG CCAAGTTGAT
GACCTGACAA TCTTCTGCTC GCTTGCTTGT GAGGTTCATG CATACGCAGG CCAGCTAGCT
GCCGAGAAGT TCGGTGTACG TGGAGCTATG GCTGGTGACA TCTGCGATGT TATTGGCCTT
GCTTCAGATG CACTTGAGGA GCAAATTGCG TTCCCTATGG CAGACTTTGA AGAAGCTGCA
GAGTAG
 
Protein sequence
MQPVLNVEDI KRVEIALTRV GVSVSELMHR AGYAAAQEAL GMGGDISNVV ILVGLGNNGG 
DGWVAAEALR SRNCNVKVVT PLEPDQISGD LARQMAQRAV RAGVSVLVGP SRQELIDLLA
TADVVLDCML GTGFHGKVRA PFDIWIECLN QSGARVLSVD VPSGLSAQKG QVESACVVAD
VTVTMIALKP GLIADAGRDV CGSIVVAPLA EQTERLVVEA DPVAWRVDLE DYLASVPAQL
NDCDKYSRGS VLVVGGSSRF PGAAVFAAKA AARAGAGYVT LAVPEAIVSC VQMMLPEVPV
IGLPCDAEGV FTEEAAPLVL QLAAMRTVTL VGPGMRVSGG TVKVTSALLD SELPVIVDAD
ALNCIARLTN NNLPDFPELT RRTAPLIMTP HRRELGRLVN QVDNPPASLV AQLEAARKIV
WADGGSELVI VAKGTATGCV GVQKAVLPKP GPVTLATAGS GDVLAGTIAG RLAQVAGQVD
DLTIFCSLAC EVHAYAGQLA AEKFGVRGAM AGDICDVIGL ASDALEEQIA FPMADFEEAA
E