Gene Apar_1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1005 
Symbol 
ID8413877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1134640 
End bp1135707 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content45% 
IMG OID645022594 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_003180025 
Protein GI257784808 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.610294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGACCT ATTTAGTAAC TGGTGGAGCT GGATTTATTG GATCTAACTT TATCCACTAT 
ATGCTCAAAA GAAATCAGGA CATTCATATC TTGAATGTGG ATGCTCTGAC TTATGCGGGC
AACCTTGAGA ATCTTTCGGA ATATGATCAA GACCCTCGCT ACACCTTTGC TCACGTAGAC
ATTAGGGATA AAGAGGCTCT GACACAGCTG TTTGAGGCTC ATCATCCCGA TTATGTCATC
AACTTTGCTG CAGAGAGCCA CGTAGATCGC TCCATTGAAG ATCCAGGCGC CTTCGCCGAT
ACCAATGTCA TGGGTACTGT TGCTCTTTTG AGTGTTGCAG AGTCTTTCTG GAATGATGGT
CAGGGTTCTT ACGGAGACCA TAAGTATCTG CAGGTTTCTA CTGATGAGGT CTATGGCTCA
CTGTCACTTG ATGATCCAAA AGCATTTTTC CGTGAGACCA CATCTTTGAG TCCACACAGC
CCCTACTCTG CATCTAAGGC TTCTGCAGAT ATGTTTGTAA AAGCCTGGCA TGACACGTAC
GGATTTCCTG CGGTAATCAC GCGTTGCTCC AACAACTATG GCCCTTACCA GTTCCCCGAG
AAGCTCATTC CTCTGATGAT TGAGAATTGC TTAGAACATA AGTCTCTTCC TGTCTATGGT
GATGGACTCA ACGTTCGAGA TTGGCTCTAC GTTGATGATC ACTGTAAGGC AATTGCTATG
GTTCTTGAGG GCGGCAGACT GGGTGAGGTT TACAACATTG GTGGTCATAA CGAGCGCAAT
AACCTTTACA TTGTTAAGCG CATCATCAGT GAAGTTGCAA GAATCACTGG GGACACTCAG
ATTACTGAGG ATCTGATTTC TTACGTTACT GACCGCAAGG GTCATGACCG CCGTTACGGC
ATTGCACCTG ATAAGATTAA AGAAGAGTTG GGCTGGTATC CAGAAACTCC ATTTGAAGAG
GGAATTGTTA CCACTATCAA CTGGTATCTA GAGAACCGTA AGTGGGTTAA GAATGTAGTT
TCTGGTGATT ACCAGGATTA CTACAAGAAG ATGTATGAAG GTCGATAA
 
Protein sequence
MKTYLVTGGA GFIGSNFIHY MLKRNQDIHI LNVDALTYAG NLENLSEYDQ DPRYTFAHVD 
IRDKEALTQL FEAHHPDYVI NFAAESHVDR SIEDPGAFAD TNVMGTVALL SVAESFWNDG
QGSYGDHKYL QVSTDEVYGS LSLDDPKAFF RETTSLSPHS PYSASKASAD MFVKAWHDTY
GFPAVITRCS NNYGPYQFPE KLIPLMIENC LEHKSLPVYG DGLNVRDWLY VDDHCKAIAM
VLEGGRLGEV YNIGGHNERN NLYIVKRIIS EVARITGDTQ ITEDLISYVT DRKGHDRRYG
IAPDKIKEEL GWYPETPFEE GIVTTINWYL ENRKWVKNVV SGDYQDYYKK MYEGR