Gene Aasi_1949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1949 
Symbol 
ID6377543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1794559 
End bp1796967 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content34% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573278 
Protein GI294661402 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.618184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTAGCAA CAGGAGTATA TATCTATTAT TACCAAGAAC AAATCATACC AAAAGCTTTT 
CAAGCAGCTA TTTCTGCATT AAAACTGCCT ATTAATGCTA AGCATGTTGA AATAAGCTTA
CTTAAAGATT TTCCTAGGTT AAGTTTATTG CTCCACCAGG TAACCATAAA AGATTCGGAA
AATACAAACA ATATATTAGT AGCTGATCAA GTAGGCTGCG TATTTAACAT TATACATTTC
CTTAAAGGGA AATACATATT AGAACAGCTT TTTGTTAACC AAGGTACTTT ATCTATACCT
GTGCAAAAAC AAGTAGCACG TACTGTTAGA ACTGATAATG CAATCGAGGA TGCACTGCAA
GCTAATTTTA CTTTTCCTGT TCATCTCAAA AAGCTCGTAT TAAATGATGT AAAATGTATA
TATCCAGGAA AGCTATCGTC TGAGCAAGTA ATTATTCATA CTCAAAAAAT TTTTGCCAAG
TTTAATATGC AAAATCAACA CTTGTCTATG CATGTTAAGG GGCAAGCTGT AATTCAAGAA
ATTGCTTATA AGCCAATATC TTATGCCTCG ACTATTCCTA TATATATTAG TACAGATATA
GAGTATAATC TTTTAGAGCA AGTTTATAGC CTGCATAGAA GTACAATTAA GCAAAGTAAT
AATACTCTCT ATCTTCAAGG TACATGGTCT CCTAATATTG TTAAACCTTA TATAGACATA
CAACTTAAGG CACCCCGTTT AGCTATTTCT AAAATTTTAG CATTCATACC AGCAGTCTAC
CGTGAACAGG TACTTTCTTA TGGTCCAAAA GGGAAATTGG CTTGCAACCT GCAGTTTACA
CAGCGAGATA AAACAAGTAT AGCAGTTGAC TTTAAGCTGC ACGAAGGAAG TCTATTACCT
AAAAATCTTA AACAATCCAT TCAAGTAGAA AACGTTGTAG GTAAATTAAC TATACCTAAC
ATACAAGAAT TAAATACTAG TACTTTACAA GTAGATAATT ATGAAGTTAA GTTAGGAGCA
AGTCAGCTCA GCGGTAGTAT CCAGCTAAAT GATTTTAAAA AATTATATAT AAAAAACCAA
GCTAAAATTG TGTTGGATTT CCCTACACTG ATGCCTGCCT TATCATCCTC TTCTGATATG
CATCCAACAG GTCAGCTTAT AGGTCATTGG GACTTGAATA CCAGTCTGGA TAATATATTA
AAACATAATG CTATACATAA AAGTACCTCT TTCAAAGGAC AACTTAGTGC GCAAGGTGTA
CAATTTATAT ATAACCAAAC TCTGTTTCAG CTACAAGAAT GTGTTTCCTT ATTATTACAA
GATAATGTAT TACACATCAA AGATGTAGCC GGACATATAG AAGGAAAACC CTTTGTGCTG
TCAGGTACCA TTGATAATTG GAGCACCTTT TTAGGAGATA ACCAACACAA GTTGCATTTC
AATGCTAAGC TATATGCAGA TTATTTAGCC TTAGATAAAA TATTGCAAGT TAGTACAATG
CAACATAATC ATAAACCTAT GGACTGGTCA TGGCTAACCA CTTATCTAAG AGGAGAGCTT
GAGTGTGATA TAGAAGAAGT AATCTATAAG CGTTTCCGCG GTAATAAGAT ACGAGGCAGA
TTTAAAATAG TAGAGCAGCA ATTAATAGCG GATGCTATAG AATTAATTTT TGCAGGAGGA
AAAGCTCGCT TAGCAGGTAG TATTAGTACT AAAGCAGATA GTTTAGAAAT ATATACATAT
GCCAACCTGC AAAATGTGCA ATTACCTGCC TTATTTTATA CTTTTGAAAA TTTTAATCAG
CGCTTCTTAG AAGATAGGCA TTTAGGCGGA CATGTATGTT CTGATATTGA GCTTACTATA
CAAACAGACA AGCAACTACA TATTGATGTT AACTCGTTAA AGGCAGACAT AGCTGTTCAA
TTACACAATG GGCTTTTGCG AGATTTTGAA CCTATGCAGC GCTTATCTGC CTATGTACCA
GAAAAAGAAC TAAAGCTGCT ACGATTTTCG AGCTTAAAGA ATAATATACA TATTAAAGAC
CAAACTATCC ATATACCTCC CATGGAAGTA CATACTAGTC TTACAAGTAT ACAGCTAGCT
GGTACTCATA CATTTGATGG GAAAATAGCT TATAACTTAG TGGTACCCCT GAGAAATGCT
AATTCAGAAG AGATAAGAAG GCAAATGCCT GAAATTAATG AGGAGGCACT AGCAGGCCTT
AATTTATATT TAAAATTAGA AGGTACCACA CAGAACTACG CTTTACGCTA TGGTAATTCA
TTATTTAAAT TAAATATTAA AGAGAATCTT AAAAGGCAGG GTACTATTTT AGGAGATATT
TTACAAGGAA ACGCTGCTCC CAAACAGGCT AAGGAGTTAT CTACCGATGA ATACTTTGAC
TTTGGATAA
 
Protein sequence
MLATGVYIYY YQEQIIPKAF QAAISALKLP INAKHVEISL LKDFPRLSLL LHQVTIKDSE 
NTNNILVADQ VGCVFNIIHF LKGKYILEQL FVNQGTLSIP VQKQVARTVR TDNAIEDALQ
ANFTFPVHLK KLVLNDVKCI YPGKLSSEQV IIHTQKIFAK FNMQNQHLSM HVKGQAVIQE
IAYKPISYAS TIPIYISTDI EYNLLEQVYS LHRSTIKQSN NTLYLQGTWS PNIVKPYIDI
QLKAPRLAIS KILAFIPAVY REQVLSYGPK GKLACNLQFT QRDKTSIAVD FKLHEGSLLP
KNLKQSIQVE NVVGKLTIPN IQELNTSTLQ VDNYEVKLGA SQLSGSIQLN DFKKLYIKNQ
AKIVLDFPTL MPALSSSSDM HPTGQLIGHW DLNTSLDNIL KHNAIHKSTS FKGQLSAQGV
QFIYNQTLFQ LQECVSLLLQ DNVLHIKDVA GHIEGKPFVL SGTIDNWSTF LGDNQHKLHF
NAKLYADYLA LDKILQVSTM QHNHKPMDWS WLTTYLRGEL ECDIEEVIYK RFRGNKIRGR
FKIVEQQLIA DAIELIFAGG KARLAGSIST KADSLEIYTY ANLQNVQLPA LFYTFENFNQ
RFLEDRHLGG HVCSDIELTI QTDKQLHIDV NSLKADIAVQ LHNGLLRDFE PMQRLSAYVP
EKELKLLRFS SLKNNIHIKD QTIHIPPMEV HTSLTSIQLA GTHTFDGKIA YNLVVPLRNA
NSEEIRRQMP EINEEALAGL NLYLKLEGTT QNYALRYGNS LFKLNIKENL KRQGTILGDI
LQGNAAPKQA KELSTDEYFD FG