Gene Aasi_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0120 
Symbol 
ID6376550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp139579 
End bp141204 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content33% 
IMG OID642681311 
Producthypothetical protein 
Protein accessionYP_001957296 
Protein GI189501579 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTAA TTTTTAGTAA ACTATATACA ATTTCCTCAT CAAACATTAC TATAACACTT 
GGTATAGTCT TTTTATTCTT CATCAATGCT TGTGATTGTT CAAACCCTAA CCAGGGGCTC
CCTACCTTAA TTAATGGCCC AAAGGATCAT AAGAAACCGA AGCAAGAAGC TGTTGTAATG
AATGTTATAC CTAATAAGTT AAAGGCCGGT GAGAGAGAAG TAAAAATTAA TTTTACACTT
TCAGATGGTT TTACGGAAGC TAGGCTCAAA AAATGTAGAT TAAAAATTAC TTATTATACA
GCAACAGGTA TCAATAAAGA TAGTTACATA ACCTATAGAA ATGCTATGTC AATGGAAGTT
CGAAAAGCAA GCCTAGATCA GGAGTTGAGC GAATTTTATC TTACATCAGT AGAACAACAT
AAATCTTTTA GCCTACCAAT TATGCTTGTG CCAAATTTTA CGCCTGGCAT GGATGTCCTT
GACCTAAAAG TAAATTTTGA GCTTCTTGAT GAAAAGCGAA AACCGTTACA AAAGGATCAA
GTAAGCTGGG AATCGCAAGC AGAACCTCCT CATAAATTAA AGTTAGAACC CTACAAAGAA
AATAAGCTAT TAGAACTAAA AGAATATAAA ATATTAGAAC TGGAAGAAAT AGAAATACAT
GGAGAAAATA GAGAGTTTAC TGTGCAGGTG AGTAATTTGG GAAGCAATAT TACGGAATCT
GATCAGTTAA AGTTAGCTAT AAGCAGGGTA GAAGGTAATC ATGCTAGCCT AAGTATAGAT
GAGGAAAATA GCCAAGATCA AGAGCTAGAT TTAGGGACAA TAGCAGACAA TACACATATT
TCTAAAAGAA TTACCATTTC TCCTGGGCAG GATGAAAAAG CTAAATTTTT GTTACAATTA
TTATATAAAG GAAAAGAATA CGATTTTTTA TATATAGAGT GGAAAAAGGT ATCTCCTCAT
ATTCGAGCTG AATATTATAG GAGAGATAAT CATATAGGAT ATTTTATTGA CAACTGTAGT
TTATTACCAA AAAAGGTTCT AAAAGTATCT TATAAAAATA TAAGTAACAA TCCAGTTACA
CTGGGTGGGG TTACAGAGAA ATTTATCTCA TTAGAAAACT TGAGGACTTT TGATCATGCT
AGCTTGCCTA TAAAATTTAA TAATCAGCCA AGTGCAGAAT TTGAGTTTGA GTTGTTATAC
ATGGATTCTG TACTATCAAC AGCGTCTATT GTAGTAGAAA ATCTTCAGCT AAAGATTATA
GATCCACGGG ATGGTCAGAT GATATATGGT AGTAATCAGG CAACATTTTC TATTAAGAAT
TTAAGCGGAG CACGTGTCAA TATAAAAAAA GTGTACATTC AATGTGCAAG TGAAAGGAAA
AATGCTGCAA CTTTTATATT TGCAAATCCA GCTAATGGTG AGATAATTGA TGCAGAAACC
CCAATTAGCT TGTCAAAGTA TATCCATAAA GAAACTTTAG AATCTGAAGA AAAAGTAGAA
CTTCTCATAC AACTTAAAGA CACTCATTCT CAGATCGGTT CATCTGTTAA CTTGCAAATC
CAAGAGCATT ATAATGAGAA GGTTACATTT TTGGATGAAA AGACCTTAAA TTGGGTGCAA
AATTAA
 
Protein sequence
MRLIFSKLYT ISSSNITITL GIVFLFFINA CDCSNPNQGL PTLINGPKDH KKPKQEAVVM 
NVIPNKLKAG EREVKINFTL SDGFTEARLK KCRLKITYYT ATGINKDSYI TYRNAMSMEV
RKASLDQELS EFYLTSVEQH KSFSLPIMLV PNFTPGMDVL DLKVNFELLD EKRKPLQKDQ
VSWESQAEPP HKLKLEPYKE NKLLELKEYK ILELEEIEIH GENREFTVQV SNLGSNITES
DQLKLAISRV EGNHASLSID EENSQDQELD LGTIADNTHI SKRITISPGQ DEKAKFLLQL
LYKGKEYDFL YIEWKKVSPH IRAEYYRRDN HIGYFIDNCS LLPKKVLKVS YKNISNNPVT
LGGVTEKFIS LENLRTFDHA SLPIKFNNQP SAEFEFELLY MDSVLSTASI VVENLQLKII
DPRDGQMIYG SNQATFSIKN LSGARVNIKK VYIQCASERK NAATFIFANP ANGEIIDAET
PISLSKYIHK ETLESEEKVE LLIQLKDTHS QIGSSVNLQI QEHYNEKVTF LDEKTLNWVQ
N