Gene Aasi_1480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1480 
Symbol 
ID6377686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp122600 
End bp125842 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content33% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003572978 
Protein GI294661103 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTAA TTGTAGGGTT TAATATAGCT GCTTGTAATT GCAGACAAGC GGAAGCGGGA 
ATAGATACAA ATCAGGCTGG TAGCTTAACA ATGACCATAA GCGATACCAA GCTAATGGGA
AGTTCACAAA CAACTGAAGT GTTCTTTAAC CTTTCAGATA ACACCAAAGA AGCACTATTA
GAAAACTTTA AATTGCAAAC TTCCGTTTTA AAAGAAAAAG GTGGCACAGG CAGCAAAATA
AAGTATAATA CCTATGTAGA TGGAAAGCCT GCCCCCAAAA AAGATACATT TGTAGACAAA
AAACTTATAC ACTTTACAAA AGATGAAAAA TTAGTTCCTG GAGATGCTCC CCTAAAAGTA
GGTTTTAAGA TAAAGCCAGG AGAGGGTGTT ACCAACTTAG TAGTTAAATT TACACTTTTA
GATGCAGCAA ATCAACCAAT CGGCAATAGT ATAAAAGTAA CCTGGAAAGA AGAAAAGCAA
CCTATAAAAC TAGCATTGGA AAGAATGAGT CCTGAGCATA TACAAGGTGC TAATAAGGCC
ATCCAGCTCA AAATTTCTAA TTTGGGAACA CAGCAACCAG AAGCCAATCA ACTTAAGTTA
AAAGCAATTA GAACAACAGG CACATCTGCT GTTATTAATG GCGCTATTCT TACAACCAAT
ACCAACGAAT ACGAGATAGC CTTAGGCAAG CTAATTGATA ATAAAAGTAC CATTCAAGGG
TTAAATATAA TTCCAAATGA AGATACAAAG GCCGAATATA CCTTACAGTT ACTATACAAA
GGTGAGCTCA TAGGCAATGA AATAAAGGCA AGTTGGGAAA AAGGTATTGA GCTAACTTTG
AGTGGTATAG AACATGACAA AGCTACCCAG TCAATTATCT ATACAGTGGC TAATACAGGT
TCAATAGTGG CAAGTAACTT AAAGATTCAG TATAGCAGTA AAACAGCAGG TATCAAGCTA
GATGGTAATA CTATAGGAAG CACACCAAGA ATAAAAAGTC TGTACAATTT AAACCCAGGT
GCCATATTAG AGAATCAAGT TCTAGGAAAA CTGGATTTTA ATGGTAATAA GAGTGCTGAT
TTTGAATTTA AAATAACCTG TGAAGAAGGC TTTACTATAA CACAGAGCCA TACCTTTTAT
GATGAAGATA TTGATTTATA TATTGACAAG TTAGTTTATG ACCAAGCCAA CGGATTAATT
ATTTATAATG TAAAAAATAA AGGCACTCGT AAAACCACTA ATAAAGTTAA ATTAAAATAT
AGTAATGTAA GTGAGGATGA TAATCTAGAC GGCCAAACTG CTTTGTTGGA AAACAGCCTT
TCAGCTACAA TAGATTTAGG AGAGATACCT GGAAACAATG GAGAAACAGG GGATAGGAAA
CTGGCTATAG ACTTCAAATA TGCTGACGAA TCATCTTTTA AATTTGAATT AATTTATAAC
GATAATATTA TCACACATGA AACAAAAGTA AAAAATTTCA AAGCAAAACC TGTGCAACTA
AGCATAGTTC CTTTAACACC TTTAAGGCTA ATGGGATCAC AACAAGAAAT AAAATGTAAA
ATAGAACTTG GGGCAGACAG CCGACCATTA GACCATATAG ACACCAGTAA ATTAAGCCTA
TCTATAACTA ATCTGTCAAG AAATAGCGCT TACCTTGCAT TAACTTCAGG AAAAGAATCT
ATAGATAAAT TGGCTGGTGA AGACCTTGGT GCTTTAGGTA ATGAAGTCAC GTTACATATT
AATTCAAATG GAAGTAGAGG AGCGCAATTT AATTTTGATA TTGAGTACAA AGGAAAATCC
CTCACTACTA ATCCTCTTAG CATAAGCTGG GAAGAAGCCA CCTTAGAAAT TATAGAACTA
AATGATTTTA TAAACAATGA TATAGCTACT TTTAAGTTAA AGAATTTAAA TCCGCTCGAC
TCTATAGATA CTGAAAGTAT TCATATAGAA CTATTGAGCA GTAATAACGC TGAGTTTACA
CTGCTAGATG CAGGTAATAC TACAATAGGA AAAACCCCTA ATTTACATCA ACTAGTGGCT
CATCTTAACT TATTACAGCC AGCTAAAGCA ACAGAGCCCA TTAGCTTTCA ACTAGCTAAT
ACCAATGGAG AAAAAGAAGC TACTATTACG CTTATTGTAA AACGGGGTAC GAATGAATTA
GCTAGAAAAG ATGTATTATG GAAAAAGGAA GAGAACCTCA TCAAACTTAA CTTTGAAAAA
CTTAGTTTCA CAAATGAAGA GGTTATTGGA ATTACATTGT TGAATGCAGG CCCTACATTA
GATGCAAATA CAACAAGAAT CCAGCTCACA AATGATAAAA ATATTCAATT TAAGTTAAAT
GGTAGTATAG AAAATAGGAT TGATACAACC TTAAAGGAGT TTATAGATGA AGATAACTTT
ATAAAAGGAG AAACTACAAA GATTTCTTTA CAAATAGCTC ATGTAGGCAA TGAGTATTCT
GCTACGTTTA CGCTAAAAAT ATTAGATAAC AATAATGAAA TAATAGGTGG CAATGAGTTA
ACGTGGACTA ACTTCCAAAA ACTAATAGAT ACAGAAGAAG ATCTAGATAG GATAGAAACA
ATTATTGTGG AAATTAGAAC GATTCAAGAA CATATTGAGA CTCTTGAAAA AGGAAAACAA
CTAAATTTAA AGATTTGGGA AGAAAATGCA GAGAAAATTA TAGATAAGCA GCTCGAGCTT
AGAAACACCC TCGAAAATAG ATTTGAGAAT ATTGGTAGTA TTGATAATCC AAGCAGAGAC
ACACAAACAG AGAAACTAAG AAATATTCAG GAATCAAATA AAGATATTGA AGAATATATT
TTAAATTCCT TAAACTATGC TAGTGCAAAT GTGGAGTATT TTGCTCAAAA AATTAAAGAT
ATAAAAGAAG GATCTAAAAC TAATGGCCTT ATGGGAGTAG TTTCATTAAA TGCAGATAGG
ACTTCTATAA AGGCTTGGGC TATGACACTT TTAAAACTAG AGGACAAATT TGGTGTTTCT
ACAAATACAG AAACAATAAC TCAAAAAGCC ATTAAAGCTT ATGAAAGTAT AAAAGAAGTT
ATGGACACGC CTATAAAAGT AAACAGAGAG TCAAATGAAA CTAATACAAA CATTTCCCAA
ATCGAATTAG ATAAGATTTT TGCTTCTCCT AAAATAGAAA CGCCTAGACG TATGCAGAAA
TTTTTTCATA AAAAAAATAA TTCTAATGAG AATCCGAAAG CAAGACGAAA ATTGGGAATT
TGA
 
Protein sequence
MGLIVGFNIA ACNCRQAEAG IDTNQAGSLT MTISDTKLMG SSQTTEVFFN LSDNTKEALL 
ENFKLQTSVL KEKGGTGSKI KYNTYVDGKP APKKDTFVDK KLIHFTKDEK LVPGDAPLKV
GFKIKPGEGV TNLVVKFTLL DAANQPIGNS IKVTWKEEKQ PIKLALERMS PEHIQGANKA
IQLKISNLGT QQPEANQLKL KAIRTTGTSA VINGAILTTN TNEYEIALGK LIDNKSTIQG
LNIIPNEDTK AEYTLQLLYK GELIGNEIKA SWEKGIELTL SGIEHDKATQ SIIYTVANTG
SIVASNLKIQ YSSKTAGIKL DGNTIGSTPR IKSLYNLNPG AILENQVLGK LDFNGNKSAD
FEFKITCEEG FTITQSHTFY DEDIDLYIDK LVYDQANGLI IYNVKNKGTR KTTNKVKLKY
SNVSEDDNLD GQTALLENSL SATIDLGEIP GNNGETGDRK LAIDFKYADE SSFKFELIYN
DNIITHETKV KNFKAKPVQL SIVPLTPLRL MGSQQEIKCK IELGADSRPL DHIDTSKLSL
SITNLSRNSA YLALTSGKES IDKLAGEDLG ALGNEVTLHI NSNGSRGAQF NFDIEYKGKS
LTTNPLSISW EEATLEIIEL NDFINNDIAT FKLKNLNPLD SIDTESIHIE LLSSNNAEFT
LLDAGNTTIG KTPNLHQLVA HLNLLQPAKA TEPISFQLAN TNGEKEATIT LIVKRGTNEL
ARKDVLWKKE ENLIKLNFEK LSFTNEEVIG ITLLNAGPTL DANTTRIQLT NDKNIQFKLN
GSIENRIDTT LKEFIDEDNF IKGETTKISL QIAHVGNEYS ATFTLKILDN NNEIIGGNEL
TWTNFQKLID TEEDLDRIET IIVEIRTIQE HIETLEKGKQ LNLKIWEENA EKIIDKQLEL
RNTLENRFEN IGSIDNPSRD TQTEKLRNIQ ESNKDIEEYI LNSLNYASAN VEYFAQKIKD
IKEGSKTNGL MGVVSLNADR TSIKAWAMTL LKLEDKFGVS TNTETITQKA IKAYESIKEV
MDTPIKVNRE SNETNTNISQ IELDKIFASP KIETPRRMQK FFHKKNNSNE NPKARRKLGI