Gene Aasi_1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1689 
Symbol 
ID8999348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp959082 
End bp960419 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content36% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573111 
Protein GI294661235 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGCAG GTGCTGATGG TCTGGGTGGT ATCCAATTTG AAAGTCCCTA TGTAGCTAAA 
GTGTTAGGTA CTATAACCCT TACACTTATT CTATTTTCTG GTGGCTTAGA TACTGATTTT
GGGCATATAC GACCTATTAT ATGGAATGGT ATCTTGCTAT CTTCAGCAGG CGTGCTAATT
ACAGCTTTAG GTATTGGGTA TTTTATTTAT TGGGTAACAG ATTTTACTCT AATAGAAGGG
ATGCTGGTGG GTGCTATTGT TTCTTCCACA GATGCTGCTG CTGTATTTTC TATTCTCCGC
TCCAAAAATT TACATTTAAA GTACAATTTA GGGCCCACAC TAGAATTAGA ATCTGGTAGT
AATGATGCTA TGGCCTATTT CTTAATGATA TTTTTTACCA AGCTGCTTAC CAGCAACCAA
GATATCAGTA TATGGTCTGC TATACCTCGA TTTTTTCAGG AAATGTGTAT AGGAGGTTTA
ATAGGAGTTA TAGTAGGGCA GGGTATGGTA TTTGTAGTAA ATAGAATTCA GTTGGCTTAT
GAATCATTAT ATCCAGGTAT TACTTTAGCC ATGATCTTGT TTGCTTACTC TGCCACTAAC
TTTATACATG GTAATGGCTT TTTAGCAGTT TATATAGCTG GAATTATTTT AGGTAGCCAA
AATTTCATCC ATAAAAATAG CTTGATACGT TTTTATGATG GTATATCTTG GCTGATGCAA
GTGGTGATGT TTATCAGCCT AGGACTTTTG GTTTCTCCCC ATAAGCTAAT ACCAATTGCA
GGTATCGGTT TGTTAATATC AGCTGCTTTG ATATTCTTGG CAAGGCCTAT TAGCGTATTT
ATTTCATTAG CATTTACTAA AGTAACCCTC AATCAAAAAG TATTTATTTC TTGGGTAGGG
CTTAGGGGTG CTGTTCCTAT TGTATTTGCT ACTCATCCTT TGCTAGAGTG TGTTGGCAAG
TCAGATAAAA TTTTTCATAT TGTATTTTTT ATTGTACTTA CTTCTGTATT GCTGCAGGGT
ACTACTTTAT ACTCATTGGC GAAGTGGTTA GGGCTAGAAG AAACAGCCCC TCATAAACAG
GCTCGCTCCA TCCATTTGGC CGATGATGTT AAGAGTGAAC TTATAGAATT AACTGTTCCA
GCTGGTTCTA GTATAGATGG TAGAAAAATT GTAGAAATAG ATTTTCCTAA AAATGCATTG
ATTGTATTGA TAAATAGGGA TAAGCGTTAT TTAACTCCAA GAGGTGACAC AGAGCTTAAG
ATAGGAGATA AGTTAATGGT TATGGCAGAG GATAAGAATG ATATCAATGA AGTAGAAGCT
TGCCTAGGCA TAGCTTAA
 
Protein sequence
MLAGADGLGG IQFESPYVAK VLGTITLTLI LFSGGLDTDF GHIRPIIWNG ILLSSAGVLI 
TALGIGYFIY WVTDFTLIEG MLVGAIVSST DAAAVFSILR SKNLHLKYNL GPTLELESGS
NDAMAYFLMI FFTKLLTSNQ DISIWSAIPR FFQEMCIGGL IGVIVGQGMV FVVNRIQLAY
ESLYPGITLA MILFAYSATN FIHGNGFLAV YIAGIILGSQ NFIHKNSLIR FYDGISWLMQ
VVMFISLGLL VSPHKLIPIA GIGLLISAAL IFLARPISVF ISLAFTKVTL NQKVFISWVG
LRGAVPIVFA THPLLECVGK SDKIFHIVFF IVLTSVLLQG TTLYSLAKWL GLEETAPHKQ
ARSIHLADDV KSELIELTVP AGSSIDGRKI VEIDFPKNAL IVLINRDKRY LTPRGDTELK
IGDKLMVMAE DKNDINEVEA CLGIA