Gene Aasi_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1784 
Symbol 
ID6376919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1307690 
End bp1309225 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content34% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573178 
Protein GI294661302 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTCT CAAAAAAGGA AAATTATTTG GTTATTATGC TAGCCATCTG TTGCTTGCAA 
ATACTAGTTT CTTGTGGCTG CGGCAACAAT CCTACGTCTC TTATCACCAA AAAAAATCAT
ATTCCAAAAA AAGTAAAGCC CATACCTCCT GTTCATCTTT TACTAAGTAG TAATAAACAA
ATATTAAATA ATACTGACAA GAGTTTTAAC CTATCTTTAG AAAATACTTC AGCAACCATA
GCTAATTTAA GCGATGGTAT ATTAAAAATA ACCTTACACG AGGAGGGTGG CTCAGGTAGT
ACGTTACGCT ATGCAACTAA TACTAATATA TATGAGCATC AGAAAGCTGT TGAAAAGCCT
TTGTCTTATT TTACTCAACA AGTAACTCTT AAAAAAGGTG ATGCTCCCTT GGTCATACCT
TTTAAACTAC ACACTCTACC AACAGTTACA AGTGTTAAAA TAACAGTGAA GCTTGAATAT
AAAGGTAAAA AGGATATTGT ACCCCCTCTA ACTATTGTAT GGGATGCAAT ATCACCCATT
ACAGAGGATA TGATTCAAAG TGTTGTACAT AATGGTTATA AACTTTTGGC TGACATACTT
ACCAAACTTC AAAAAGGAGA AGAGATAGCT ATTAATGATG TTACCGCAGT TTATCCAAAA
GAAACGGCTT TACACCAAGC TGTAAAATTG GGTGACGAAT ATATTGTTGA ACTCTTATTA
GAGAAAGGCG CAAGTATAAA TATACAAAAT ATAGAAGGAG AAACTGTCTT GCATTTGGCT
ACTAATTCGA ATAATACAGA CTTAGCCAAA AAAATAATAG GTAAAGGGGC AAAACTAGAG
GTGCAGAATA AGAGAGGTTA TACGCCTTTG CATTTAGCAG CCGAACAAGG TTATATAGAT
GTTGCTAAAG AATTAATACC ACATTTAAAT AGCGAACAAT TAAATCTCGC AAACATAGAA
GGGCAGACTC CATTACATTT AGCTGCTTCG TGGGGTCATA GTAAAGTTGT ATCATTATTA
ATACCTTATT TGGACACATG GGAACTCAAC CAGAAAGATC TTCAAGGTAA TTCTGCACTA
TATAAAGCTA GCCAATATGG ACATATAGAA ACAGTAAAGA GACTACTAGA TGCTGGCGCT
AAAATAGATG AAGCCAATGG TCTTGGTTTT ACTCCGTTAC ATATTTCTAT TATTGAGGGG
ACGTCTGCTG TGGCACGTGA ATTGACAAAT AGATTATCTA CAGAACAATT GAATCAACCA
GATATAAACG AGTATACACC ACTATACCTT GCTATATTAC ACAGCCATAC AGAAATAGCT
GAAGAATTAA TAAAAAAATT GGAGCCTGCA CAGTTAAATA AACAAAATGA TCAAGAGAAT
ACCCCCTTAC ATAAAGCTGT TGAGAAGGGC AATATAAAAA TAGCTAAACA GCTTATTGCT
AAAGGTGCAG ACATAACTAT AAAGAATAAA AAGGACCAGT CTCCAATGGA TCTAGCTAAA
TTAGATGAGA TGAGAAGGAT ATTGCAACTT ATGTAA
 
Protein sequence
MQFSKKENYL VIMLAICCLQ ILVSCGCGNN PTSLITKKNH IPKKVKPIPP VHLLLSSNKQ 
ILNNTDKSFN LSLENTSATI ANLSDGILKI TLHEEGGSGS TLRYATNTNI YEHQKAVEKP
LSYFTQQVTL KKGDAPLVIP FKLHTLPTVT SVKITVKLEY KGKKDIVPPL TIVWDAISPI
TEDMIQSVVH NGYKLLADIL TKLQKGEEIA INDVTAVYPK ETALHQAVKL GDEYIVELLL
EKGASINIQN IEGETVLHLA TNSNNTDLAK KIIGKGAKLE VQNKRGYTPL HLAAEQGYID
VAKELIPHLN SEQLNLANIE GQTPLHLAAS WGHSKVVSLL IPYLDTWELN QKDLQGNSAL
YKASQYGHIE TVKRLLDAGA KIDEANGLGF TPLHISIIEG TSAVARELTN RLSTEQLNQP
DINEYTPLYL AILHSHTEIA EELIKKLEPA QLNKQNDQEN TPLHKAVEKG NIKIAKQLIA
KGADITIKNK KDQSPMDLAK LDEMRRILQL M