Gene Aasi_0846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0846 
Symbol 
ID6377177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1070101 
End bp1071720 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content38% 
IMG OID642681985 
Producthypothetical protein 
Protein accessionYP_001957946 
Protein GI189502229 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0557159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA TAAATGCATT AGCAGATCTT CCCTCGAGCA AGCACTCACT CAATGAGCTG 
GAAAAGCTAT CTGTACAGGT TGAAGCTGCC CTGGATAAGG AGAGGTATAA TGCAGTCGTA
GGTGAAATCA AGCAAGTGTT GGATACAAGA ATAGAAACTA TTAACAGGCT ACAAGAGGAA
GAAGCAGACC CAGTTATTAT CATAGCAACG GCACGCGAGA TAAAGCCCCA TGTAGAAGCA
GCTATAAAAC ACTTGGAGGA GTTTTCGACC AAGCAAGATA AGAGTTCTAG TATAGAGAAC
TTGAAAGAAG CCAAAAGTCA GCTTGAAGGG GTGATTAGAG ATTATACTCA CAGGAAAGTT
TTATGGGATA AGTACAAGTC AGCTTTTAGC TATGCAGCCA AGCATGTTTC AAAAAGGATA
GAGCAAGAGC AAGAAGAATT AGCTAAGAGC ATAGGAAAGT CAGTCGAAAA GCCTGAAATA
ACTATAACCA ATAAGTCAGG TATAACCATA GCGCAACTAG AGGAAATTTT TCCAGAAGAC
TCCAGCAATA ACTTCAATTT GCAAAAACTG ATAGATACTT TAGATATAAC TGTTAATTTC
CCAGCAGCAG AAGAGAATGC TAACAGTAGC GAGGAGGATG AAGAAGAAGA AAGGCAAGGT
GTAGAGGAGC ATTTAGAAAG AGAAGCTAAG TACGAGAAAA CAGTAAGCCA AAAGGAAGAA
CAAGAAAGGG CTAGTAGCCA AGCAGGTATA AATGAGAAAG AGAAAGAATC AGCAGCAAAA
AGGCAAGCCA GAAAAGAACA AAGAAGAGCT AACCAAAAAG CCAGAAGAAA AGAAGCATCT
CCCAAAAAAG CCCATGAAAA AGAGCTGATA AAAATAGGAC AAGCTAAAGA GCAACTTAAA
AGCTTGGTTA AAGATCAAGT GAAATCTTGT GTAGAGGATG CTGAATTTAC TAATTGTAGA
GTAGTATTAT TGATGATCGA TGAGAAGCTC AAAAAAGAAC CCAATAATAA GTCTTTACAA
GCGTTAAGGG TAAAACAGCT AGAGAAATTC CAAAGCCACC CGCTTTACAG CCAAGTTATA
GCGTTAGGAA TAGAAGTTAT GGCTCTATGC CAGCAAGCAG AGTTAAAAGC TGACGAGAGA
CATATTAGGG CTACAATATT ACCGCATATT ATACTACCAT TAGAGATAAA TCAGTCAAGA
GAATTTCGTC AAGAATCAGA AGAAATTGAA GCAACTATTC TTAACTCTTT ACGAGATCTA
TTATGCTGTT CGCACAGACC AAGCGGGTTA AGAAGGACAG CAGAAGGTAT AAAAGGATTT
GTTGAAAGTT TTTATGATAC AGGAAAACTA TTTTTAGTAG ATGTACCTGT CTTATTAGAT
AATTTAGGGA TGAAAGGAAT TGAATCATTG GTAAAGTGGG AAGATAAGCT AGGTATGAAA
CAGGGCTGGG CTAACTTAGG AGAATTTTGT AAAAAATTGT TTGATGAAAC AAAACAAGCC
TGGGAAGACC CAGAGGCTTA TAAAGCTCGT ATGCAGGCTG CTCATGAAAT AAAAAGGAGG
GAGGCACTTT ACCGTCGTGC ATTTGATGCA GAGAATTACC ATTTAGGGCA AGATAAGTAG
 
Protein sequence
MKNINALADL PSSKHSLNEL EKLSVQVEAA LDKERYNAVV GEIKQVLDTR IETINRLQEE 
EADPVIIIAT AREIKPHVEA AIKHLEEFST KQDKSSSIEN LKEAKSQLEG VIRDYTHRKV
LWDKYKSAFS YAAKHVSKRI EQEQEELAKS IGKSVEKPEI TITNKSGITI AQLEEIFPED
SSNNFNLQKL IDTLDITVNF PAAEENANSS EEDEEEERQG VEEHLEREAK YEKTVSQKEE
QERASSQAGI NEKEKESAAK RQARKEQRRA NQKARRKEAS PKKAHEKELI KIGQAKEQLK
SLVKDQVKSC VEDAEFTNCR VVLLMIDEKL KKEPNNKSLQ ALRVKQLEKF QSHPLYSQVI
ALGIEVMALC QQAELKADER HIRATILPHI ILPLEINQSR EFRQESEEIE ATILNSLRDL
LCCSHRPSGL RRTAEGIKGF VESFYDTGKL FLVDVPVLLD NLGMKGIESL VKWEDKLGMK
QGWANLGEFC KKLFDETKQA WEDPEAYKAR MQAAHEIKRR EALYRRAFDA ENYHLGQDK