Gene Aasi_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1842 
Symbol 
ID8999539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1480110 
End bp1481306 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content36% 
IMG OID 
Producttransposase 
Protein accessionYP_003573215 
Protein GI294661339 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTTATA GGTGGTTTTG TAAGCTTGAG TTAGAAGACA AGGTGCCTGA TCACTCTTAC 
TTAAGTAAAA CAAGAAACAG ATTTGGTGAA AGAATATTTG AAGCGCTCTT TACTACTATA
CTGAATTTGT GTCGAAAGCA TGGCTTATTG GGGTCTAATA TTATGATGAC AGATAGCACA
TTAATTAAAG CTAATGCATC ACTAAATTCA TTAACTCCTA TCGAAGCTAA AGCAGCAGTT
GATGAGAAAG CAGCAAGAAA AGCTGCTATA AATAAACTTG GACCTCCTGT CAGTAGGAGT
ATAAGCAACA GCACTCATAT TAGTAAAACA GATAAAGACG TAAGCCTGGC TCAAAAGGAA
GGCAGTCCTC GGGAGTTGAA ATATAAAGTT CACAACTCTA TTGATGCAAC AAGCAGAGTA
ATAATTGATA CAAAGGTTAC AACGGGCAAG ACCCACGAAT GTGGTGTATA TGGTGTATAT
ATAGAGAGAA TCAAGTATAT TATTCAAAAG CACAAGCTAA ACATAAAAGA AGTTGTAGCA
GACAGGGGTT ACGGTTCAAG AGAGATAATA GAGACTTTAC AGAAAGAAGT TATAGTAAGC
TATATACCAC TCTTTAGTAC TAAGAGTGGT AGGACCATAC AGGAAGCTTA CAGCGCTGGG
TTTGTCTATC AAAAAGAACA GGATCGCTTT ATATATCCTG AAGACCAGTA TTTGAACCCT
TATGGCTTTC TGAATGGGGA GAGTAAATAT TACAGGTCAA AATCATCCAT TTGTGCTATA
TGCAAGCAAA AAGATGCTTG TATTGCTTCA GCTAAGAAAA GCAGGCCATT CACAAAATAT
CTTATTAGAA GCATACATCA AGAGTTGTTT GATAAGACTT TTGAAGCTAT GCAAGAGTCG
GCATTGATTG GTAAGCTTAA AGAAAGGATG TGGAAAATAG AAGGTATTTT CGCTGAAGCT
AAGCAGTTAC ATGGGTTAGG TAAAGCTCGT TATAGGAGGT TGGAAAGAGT GCAAATACAA
GCATATATGG TAGCTGTTGT ACAAAATATT AAAAGAATAA TTAAGCAGCT TTTTTATGTC
TTTCTTTATT TTCTTAACAT GCTTAATAAA ATATTTTTAA CATATACTTT TTCAACAGCC
CCAGTCTTCT TGCATACCTC TAAATACATT AGACCCGCTG CGAAACAACT GCCGTAA
 
Protein sequence
MAYRWFCKLE LEDKVPDHSY LSKTRNRFGE RIFEALFTTI LNLCRKHGLL GSNIMMTDST 
LIKANASLNS LTPIEAKAAV DEKAARKAAI NKLGPPVSRS ISNSTHISKT DKDVSLAQKE
GSPRELKYKV HNSIDATSRV IIDTKVTTGK THECGVYGVY IERIKYIIQK HKLNIKEVVA
DRGYGSREII ETLQKEVIVS YIPLFSTKSG RTIQEAYSAG FVYQKEQDRF IYPEDQYLNP
YGFLNGESKY YRSKSSICAI CKQKDACIAS AKKSRPFTKY LIRSIHQELF DKTFEAMQES
ALIGKLKERM WKIEGIFAEA KQLHGLGKAR YRRLERVQIQ AYMVAVVQNI KRIIKQLFYV
FLYFLNMLNK IFLTYTFSTA PVFLHTSKYI RPAAKQLP