Gene Aasi_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1064 
Symbol 
ID6377165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1376518 
End bp1377531 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content37% 
IMG OID642682177 
Producthypothetical protein 
Protein accessionYP_001958138 
Protein GI189502421 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.732456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA TATTAGTAAC TGGCGGAGCA GGTTTTATAG GTGCTAATTT TATACCCTAT 
TTTTTAAACA AGTACCCAGA ATATGAAATA GTTAATCTAG ATAAGCTTAC GTATGCTGGC
AACTTGAATA ATTTAACAGA AGTGCATTCA AATCCCCGTT ACCACTTCGT GCAAGGTGAT
ATTACCAACA GAGAGTTAGT ATCATCTTTG TTTAGGCAAT TTGACTTTCA AGGAATTATT
CACTTAGCAG CAGAGTCACA TGTAGACCGT TCTATTCAAG ATCCTACCTT ATTTATTAAA
ACCAATATAG AAGGAACGTT TGTTCTGTTA GAGGCAGCCC GTCTGCATTG GATGCAAAAA
CCTGGGGAAT ATAAACAAGA TTACATAGAA AGTCGCTTTT TACACGTATC TACAGATGAG
GTATATGGTA GCTTAGGGCC TGCTGGTTTT TTTACAGAAG AAACCCCGTA TGCACCTAAC
AATCCTTATA GTGCTACCAA AGCAGGCAGC GACCTGCTAG TGCGTAGCTA TGTACATACT
TATGGGTTTA ATGCCATAAC TACCCATGCT TCCAACAATT ATGGTCCCAA ACAATACCCC
GAGAAACTTA TTCCTATTAT TATTCAACGT GCGCTAGCAC AACAACCTAT TCCTATACAT
GGCAAAGGAA ATGCTGTTAG AGATTGGATT TATGTACTAG ATCATTGTAA AGGTATTGAT
TTAACCTTTC ATTATGGACA AATCGGAGAG CATTACAATT TTGGAGGTAA CCATGAGCAA
AACAACCTAC AAATAGCTTA TCAGGTATGT GCTTTGCTAG ATAAACTAGC ACCACTGTCC
AATAGAAGTT CTTATCAATC ACTCATTACT TTTGTAACAG ATAGGCCAGG CAATGATCAA
CGGTATGCGT TAGCCACCCA AAAAGCTGAA AAAACTTTAG GCTGGAAAGC AGAAGAACCT
TTTGAGACAG GATTGCAAAA AACTGTACAA TGGTACTTAA AAAATAAATT ATAA
 
Protein sequence
MKNILVTGGA GFIGANFIPY FLNKYPEYEI VNLDKLTYAG NLNNLTEVHS NPRYHFVQGD 
ITNRELVSSL FRQFDFQGII HLAAESHVDR SIQDPTLFIK TNIEGTFVLL EAARLHWMQK
PGEYKQDYIE SRFLHVSTDE VYGSLGPAGF FTEETPYAPN NPYSATKAGS DLLVRSYVHT
YGFNAITTHA SNNYGPKQYP EKLIPIIIQR ALAQQPIPIH GKGNAVRDWI YVLDHCKGID
LTFHYGQIGE HYNFGGNHEQ NNLQIAYQVC ALLDKLAPLS NRSSYQSLIT FVTDRPGNDQ
RYALATQKAE KTLGWKAEEP FETGLQKTVQ WYLKNKL