Gene Aasi_1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1197 
Symbol 
ID6377379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1531894 
End bp1533531 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content32% 
IMG OID642682298 
Producthypothetical protein 
Protein accessionYP_001958256 
Protein GI189502539 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00397555 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGCCTA CCGTATGCAT ACAAAAAAAT AAAGTTTTTA CAACCTGTCT TATATCCATT 
GTAACAATAA TTTGCTTCTA TGCAGTTTCT TTGGCTAAGA ACAAGGATAA AGTAAAGATA
AACTATAGTG CAGATAAACT AGAAGGTGGC GAAAAGGAAA ATGGAGAAGA ACCTTATAAA
AAACTTTCTG GACACGTAAT TTTTATTCAT GAAGATTTTA CCATTTATGC TGATTCTGCT
CAATATTATG ACCAAAAAGG AATTGTAAAA GCAGCTGGTA ATCTTCAGAT GATTGATAAG
GAAGGTGGAG TTATGGTTGC AGAAAGGGTT GTATATGATG TTAATACAAA GATAGCTCAG
CTACGCAATT CAGTATCTTA TGAGCAAGAT ACCCTTAAAT TTTATACTGA TGAGCTGGAT
TATGTGGTAA AGGATAAAAA AGGATATTTT AGAAATGGAG GTACTTTAAT TCAGGATAAT
GATCAAATAA GCAGCCAGAC CGGATATTAT GATGAAAAAA ATAAATTAGC TGTCTTTTCC
CATCAAGTGG AGCTTAGTAA TAAAGAATAT CATGTGGCAT GTGATATGCT ACGCTACCAT
ACTAATACTA AGTTAGCTGA GTTTAAGGGT AATACACACA TAGTCACTAA AGAAGGAGAA
ACTATTACTA CTAAGGAAGG TGGCCAATAT AATACAGATA CAAAAGACGC ATTGTTTAAA
AAGGCAAGAG TGGAATCTGA AAAGTACAGC TTGTATAGTA ATCTAATAAA AGCTAATCAA
GAGAAAAATC AGTATACAGC AACAGGACAA GTGGAGTTAG TTTCCAAAGA ACATTATGTA
ACTATTACAG GAGAGCATGG TTATTATGAT TATGATAAAG GAGTAGGAGA AGTTCTTGGT
AATCCTTTGT TACAGCGTAT CATAGAGGAA GATACTTTAT ATATGATAGC AGATACTTTT
AAAGCTATAC AAGACAAAGT TCATGACAAG GATGATGAAA AAGACCATGT GATTTTAGGT
TATAACAATG TAAAAATATA TAAATCAAAT CTACAAGCCA AAGCAGATTC TATGTCCTAC
CATAGCATAG ATTCTACTGT TTATTTTTAT AATAAACCTA TTTTTTGGAA TTATGATAGC
CAAATTACTG CCGAATCTAT TCGTATAGCA CTTAACAATG AGGCTATTGA GAAAATGTAT
ATGGATACAG ATGCTTTTAT AGCTTCTGTA GATAAATTTG ACAACTATAA TCAAGTAAAG
GGCAGAGAAA TGGTAGCGCA ATTTCAAGAT AATAAAATCA GCTATATAGA TATCTTAGGT
AATGGAGAGA GCCTCTATTT TGCACTTAAT GATAGTTCAG AGTTAGTAGG TATGAATTAC
ATTCGATGTA GCCATATACG TATTGATATG GATAATGAAA ACTTATCAAA AATTAGCTTT
CTTGTAAAAC CTACAGGTAT TTTTTATCCT GCTCACAAAA TTATGGAAGA TGAGAAGCAG
CTGTTTGGTT TTAAGTGGAG AGTAAATGAA AAACCTATAT TAGAAGAGTT TTTATTAAGA
AAACAAGTTG CTGCACAAAA AGCTGGTAAT AAAAGCGCAC AAAAAAACCA TATAAATTCA
AAGAAAGAAC TTAATTAA
 
Protein sequence
MMPTVCIQKN KVFTTCLISI VTIICFYAVS LAKNKDKVKI NYSADKLEGG EKENGEEPYK 
KLSGHVIFIH EDFTIYADSA QYYDQKGIVK AAGNLQMIDK EGGVMVAERV VYDVNTKIAQ
LRNSVSYEQD TLKFYTDELD YVVKDKKGYF RNGGTLIQDN DQISSQTGYY DEKNKLAVFS
HQVELSNKEY HVACDMLRYH TNTKLAEFKG NTHIVTKEGE TITTKEGGQY NTDTKDALFK
KARVESEKYS LYSNLIKANQ EKNQYTATGQ VELVSKEHYV TITGEHGYYD YDKGVGEVLG
NPLLQRIIEE DTLYMIADTF KAIQDKVHDK DDEKDHVILG YNNVKIYKSN LQAKADSMSY
HSIDSTVYFY NKPIFWNYDS QITAESIRIA LNNEAIEKMY MDTDAFIASV DKFDNYNQVK
GREMVAQFQD NKISYIDILG NGESLYFALN DSSELVGMNY IRCSHIRIDM DNENLSKISF
LVKPTGIFYP AHKIMEDEKQ LFGFKWRVNE KPILEEFLLR KQVAAQKAGN KSAQKNHINS
KKELN