Gene Aasi_1096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1096 
Symbol 
ID6376290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1410456 
End bp1411658 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content38% 
IMG OID642682208 
Producthypothetical protein 
Protein accessionYP_001958168 
Protein GI189502451 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.081293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGT TAAGAGATCG TGTTCAACAA CTTGCACAAG CCAATCAAGC TCAAATAGTT 
GCCATTAGAA GGCATTTACA TGAAAACCCA GAACTCTCTT TTCAAGAATT TAATACTGCA
AAATTTATAG CAAAGACATT AAGAGAATTT GGATTTGAAG TACAAGAAGG CATTGCCAAT
ACAGGCCTTG TAGTAGTTAT TAAAGGCAAA AATCCTTCTA AAAGAACTAT TGCACTACGT
GGTGATATAG ATGCATTGCC TATACAGGAA GAAAATACAG TTTCTTATAA ATCTAAAGTC
GAGGGTGTCA TGCATGCCTG CGGACATGAT GTACATACAT CCTCTTTAAT TGGTACAGCA
CTTATCCTTC ATAGCCTACA AGCAGAGTTT GAGGGTACTG TTAAGCTTAT ATTTCAGCCA
GCAGAAGAAA AAGCACCTGG CGGCGCTATA AACATGATTA AAGAAGGAGT GCTCCAAAAT
CCAGCACCTG CTATCATATT GGGCCAACAT GTATGCCCTA TTATACCTAT AGGAAAAGTA
GGTTTTACTA AGGGCACAGT AATGGCGAGT GCTGATGAAA TATATATTAC TGTAAAAGGC
AAAGGTGGCC ATGCAGCTTC TCCACATGCT GCGGTTGACC CTATCCTCAT TGCCTCTCAT
ATTATTGTAG CCTTACAACA AATTGTAAGT AGGAACACAG ATCCCTTAAA ACCTTGTGTA
TTATCTATCT GCCAAATTAA AGCTGGAGAA GCAACCAACG TTATTCCTGA AATAGTAAAT
TTATCAGGAA CAATCCGTAC CGTAAGTGAA GAATGGCGCA AAGAAGCACA CAAAAAGATA
ACCCATCTTT GCCAGAGTAT AGCTGAAGGT ATGGGAGGTA CTTGTGAAGT CAACATAGGG
CAAGGCTATC CACCTACTTA CAATCATCCT GTAATGACCG AAAGAACATT CGAAGCAGCA
TGTAATTATA TGGGACATGA TAATGTACAT TATATGGATA TGAATATGGG AGGAGAAGAT
TTTGCCTATT ATGCTCAACA GATACCTGGC TGCTTTTATA TGATAGGCAT ACAAAATATA
GATAAGGGTA TTAATTCCTT TGTACATACG CCAACATTTG ATGTAGATGA GAAGGTTTTA
GAGATAGCAC CAGGACTTAT GGCATGGTTA GCGTTACATG AGTTAGCTGT AAGCGAAGAC
TAA
 
Protein sequence
MKQLRDRVQQ LAQANQAQIV AIRRHLHENP ELSFQEFNTA KFIAKTLREF GFEVQEGIAN 
TGLVVVIKGK NPSKRTIALR GDIDALPIQE ENTVSYKSKV EGVMHACGHD VHTSSLIGTA
LILHSLQAEF EGTVKLIFQP AEEKAPGGAI NMIKEGVLQN PAPAIILGQH VCPIIPIGKV
GFTKGTVMAS ADEIYITVKG KGGHAASPHA AVDPILIASH IIVALQQIVS RNTDPLKPCV
LSICQIKAGE ATNVIPEIVN LSGTIRTVSE EWRKEAHKKI THLCQSIAEG MGGTCEVNIG
QGYPPTYNHP VMTERTFEAA CNYMGHDNVH YMDMNMGGED FAYYAQQIPG CFYMIGIQNI
DKGINSFVHT PTFDVDEKVL EIAPGLMAWL ALHELAVSED