Gene Aasi_0312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0312 
Symbol 
ID6377606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp360384 
End bp362342 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content38% 
IMG OID642681491 
Producthypothetical protein 
Protein accessionYP_001957476 
Protein GI189501759 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAGCA CAAATAAAAG AATTACAGCA GCTATTTTAC TTTGTAGTCA GCTATTAACC 
ATAACAAGTT GTAGTAATTA TCCTAATATA CCCACCCAAC CTAAAGTAGC TCGTACGGAA
AAGAGAATTG ATAAAAAACC AAGCAAAGAT GACTCAATAT TACCCTTGGT AGATAGCTTA
GTGATTACTA TTGCTGATGG GAAAAAACTT AATTTAAAGT ATGAGAACAA AAATTGGCAA
GCAGCGTTAA TTACAGAAGA TGGTGTGGAA CAGATATTGA CCATTGTATT AGAGCCAGGC
TATAGCATAG CTAAGCTAAT TACTGCTAAA GAAGAGGAAC AGCGAGAGCT AGTGCGCCTA
GTAGCCAGTA AAAAAGAGAC ATTAAATGCA GCATATGTAT ATATAGGCAA ACAAGCTTGT
ACAGAGCAGT CTATAACTAG CCAGGTGCAG ACAAAAAGCA GCACAAAATT ATCGGATGAT
GCATCAGCTT CTTCCAGAAT GCTAATTACT AGTACAGCAC ATACAAAAAT ACCTATCTCT
ACATTATCCA AGCATATAAA AGAAGAGCAA AAGCTAGCTT CCAAACAAGG TAATTCTATT
ATAGTAAACT CTATAGTGGA CAATACCAAG CAGGCCATAA AGAGAGCTTC ATCAGTATCT
GTATTACGTA ATCAAGTTTT TATCTTATCT ATATCTATTA AACCAACGTA TAAGGGGCAG
CAACCTAAGA ATAATAGTCC TTCCATTACT TTAATCAGAG GCAAAGCTTT GGAATTGCAC
AAAAAAGAGG CACAAGATAA GCGAGAACAA GCCAAGCTAT ATAATGAAGT AGATATAGCT
GCATCAATAA CTGACAGTAA TGGCATAGCT GCACAGTTAA TAGAAGAGCA GGCTATCCCA
AGCTATATAG CAAAAGGAGG CCATCAAGTA TATCCATCCT TTATAGAAGG CAAATGGATG
GCAGTTGTTA GGGAGCATGC TCCTTTAGGT TTTAGTAGAA CGCACTATTT GGAACTGTAC
CTAGCACCTG GCTTTACAGT CAATGAATTA AGCAAACATA GTTTGAAGTG GCAGGAAAAA
CATATAGCAG TAGTATTTGC TGAGCACAGC AAAAGTGGTA AAGGATATGT ATATATAGGT
GAGAAAGGAC TGTTGGGGGG AGGAAATAGT GGGTCTAAAG GAGGTGGTGG AAATGATAAC
GATCGCTCCT CTGGAGGTAG TAGTAGTTCA AGTGGAAGCA GCAGAAAAAA TGTGTCTAGC
GGAAGTAGAA AATCCTCTTC TAGTTCTAGT GGAAGTACAA GACATGATAA ACAGAGTAGC
CAAAGCACTA GCAGGTCAAG CACTTCCTCT TCAAGCACTT ATAGTAGTAG TTTTACTCCT
AGTACAGAGC AACGTGCTAC ATCGGCTATG TTATCTTCTA TAGGTATCAA ATCTGAACTA
CCTACTTATC ATTTTGATAA GTCATATACT AATGATTATT CTTATAGTAC TCCACATGTT
AGTTACCCCA ATCGAGATAC CTCATATTCG GGCAGTGTTA TGGACAGCAT TAGTAGCAGC
AGAATATCCA CGGTTAGTAC ACCAACCCCC ATAATTTCTT ATAGAACGTC TACTAGTATG
GGAGATATAC CAAGTAGTAG TCATACAAGA GAACCTTCTG CTAATAGAAC AACTCCTAGT
AGTAGTATAA CAGGAACTGC TGCTTACATG AAGGCACCAA GTGAGAAAGG GCTACCCCTT
ACGCTAAAGG ATACAGCTAA AGAGATAAAA AGCTATTTGC AAGAAGTTCA AGGTAGTGGG
CAAAAAGACA TTGATTCAAT AATTAAGCAG CGGGAAAAAG GAGAGCAATT GTTAAGACGG
CTACATACAC TTAAGCAACA ACAAGAGAGA GCGCATGCGT ATACAGAAAA AGCATATATT
ACAGCAGATA ATTCTGGGAT AGGCGATCAG AAACTCTAA
 
Protein sequence
MYSTNKRITA AILLCSQLLT ITSCSNYPNI PTQPKVARTE KRIDKKPSKD DSILPLVDSL 
VITIADGKKL NLKYENKNWQ AALITEDGVE QILTIVLEPG YSIAKLITAK EEEQRELVRL
VASKKETLNA AYVYIGKQAC TEQSITSQVQ TKSSTKLSDD ASASSRMLIT STAHTKIPIS
TLSKHIKEEQ KLASKQGNSI IVNSIVDNTK QAIKRASSVS VLRNQVFILS ISIKPTYKGQ
QPKNNSPSIT LIRGKALELH KKEAQDKREQ AKLYNEVDIA ASITDSNGIA AQLIEEQAIP
SYIAKGGHQV YPSFIEGKWM AVVREHAPLG FSRTHYLELY LAPGFTVNEL SKHSLKWQEK
HIAVVFAEHS KSGKGYVYIG EKGLLGGGNS GSKGGGGNDN DRSSGGSSSS SGSSRKNVSS
GSRKSSSSSS GSTRHDKQSS QSTSRSSTSS SSTYSSSFTP STEQRATSAM LSSIGIKSEL
PTYHFDKSYT NDYSYSTPHV SYPNRDTSYS GSVMDSISSS RISTVSTPTP IISYRTSTSM
GDIPSSSHTR EPSANRTTPS SSITGTAAYM KAPSEKGLPL TLKDTAKEIK SYLQEVQGSG
QKDIDSIIKQ REKGEQLLRR LHTLKQQQER AHAYTEKAYI TADNSGIGDQ KL