Gene Apar_0320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0320 
Symbol 
ID8413168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp366605 
End bp367573 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content52% 
IMG OID645021887 
ProductROK family protein 
Protein accessionYP_003179342 
Protein GI257784125 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0261245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATACG TATTGGGCAT TGATGTTGGT GGTACCACCA TTAAACTGGG ACTTTTCTCC 
ACAGAAGGAG AGCTGCTTTC TGAGCAGAAG GTCAAGACGC CTGCACTTGA TAACGAGGAC
GGTTATCAGA CGGTAACCGA TGCAATTAGG CTTATTGTTC ATGGTCAAAA AGCAAGCCGC
AATGATGTTA TTGCGTGTGG TTTGGATATT CCAGGTCCTG TTGCAGATGA TGGAACCGTC
GGTTTTCTCG CTAATGTAGA CATTGACCCT GAGGGATTGG TACAGGCAAT TAATATGTGC
TTGCCAAACG CAACCATCGC GTTTGTTAAT GACGCAAACG CCGCGGCTTT GGGCGAAGCG
TGGGCTGGCG TTGCCGTGGG CGTGCCGTCG TTTGTGCTGA TTGCGTTGGG AACAGGTGTT
GGCGCAGGCG TTGTAGTAGA CGGTAAGCTT GCTGCAGGTG CTTTTGGCGC TGGTGGCGAG
ATTGGCCACA TTATTGTTGA GCCAGAAGAA ACTTTGACTT GTGGCTGCGG TCGTCATGGC
TGCCTGGAGC AGTACGCTTC CGCTAAGGGA GTTGTTCGCT TGTACCTGGA GGAATGCGCC
GCTCGTGGTG TTGTTCCTGT GAACATTGAG CACGAGACTG ATACCGTGTC CGTGTTTAGA
GCCCATGCTC AAGGAGATGA GTGCGCAACC CTTGCTATCC ACAAGATGTG TCACTACCTT
GGCCTTGCTA TGGCGCAGGT TTCGTGCGTG GTTGATCCTG CTATGTTTTT GATTGGCGGT
GGCGTAGCAG GCTCGTTTGC AACATTTGCG TTGGAGCTTC GCGAGACCTT TGAGCAGTAT
GCTCTACCGG TTAGCAAGGG CGCTCGTATT GAGGCCGCTA GCTTGGGTAA TCAGGCTGCA
ATGTATGGTT GCGCATATGA GGCGTTGCGT CTTAGAAAAG AACGCTTTGG CCAGGAGGAA
GCAGAGTAG
 
Protein sequence
MEYVLGIDVG GTTIKLGLFS TEGELLSEQK VKTPALDNED GYQTVTDAIR LIVHGQKASR 
NDVIACGLDI PGPVADDGTV GFLANVDIDP EGLVQAINMC LPNATIAFVN DANAAALGEA
WAGVAVGVPS FVLIALGTGV GAGVVVDGKL AAGAFGAGGE IGHIIVEPEE TLTCGCGRHG
CLEQYASAKG VVRLYLEECA ARGVVPVNIE HETDTVSVFR AHAQGDECAT LAIHKMCHYL
GLAMAQVSCV VDPAMFLIGG GVAGSFATFA LELRETFEQY ALPVSKGARI EAASLGNQAA
MYGCAYEALR LRKERFGQEE AE