Gene Aasi_0475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0475 
Symbol 
ID6377145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp585003 
End bp586028 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content35% 
IMG OID642681635 
Producthypothetical protein 
Protein accessionYP_001957614 
Protein GI189501897 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0016] Phenylalanyl-tRNA synthetase alpha subunit 
TIGRFAM ID[TIGR00468] phenylalanyl-tRNA synthetase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACAAC AACTTGATTT AATTCAACAC GAAATAGAGC AGTATCATCC GACTACACTT 
CAGGAATTAG AAGAATTTCG CATAAAATTT CTAAGTAAGA AGGGAACTAT TACTCAGCTT
TTTACTGAGT TCGGCCAATT ATCTCCTGCT GATAAACAAG CTTTAGGTAG TAAGCTAAAT
GCATTAAAAC AAATTGCACA AGAAAAATAC AAAATATATG CATCACAGCT TCAATCAACA
CCTAAGCAAA ATACAGATGT AAATGAGGAC TATACCTTAC CTCCACCCGC TGATAAACTT
GGATCTAGGC ATCCGATAAG TATTTTAAAA GATAGAATTT TAGAAATATT TGAAAAAATT
GGATTTAGTA TAGTAGAGGG ACCAGAAATA GAAGATGATT GGCATAATTT CGGAGCGCTT
AACTTCTCAC CTAACCACCC TGCCAGGGAT ATGTTAGATA CTTTTTTCAT TTCTCAGTGT
CCTGATATAT TATTAAGAAC ACATACCTCT TCTGTGCAGA TACGGGTGGC TGAGAGTCAA
GCACCACCTA TACGCGCTAT TTCAATAGGA CGAGTTTATC GTAATGAGAC TATTTCTGCA
CGTTCGCATT GCATGTTCCA TCAGGTAGAT GCGTTCTATG TTAATAAGAA TGTAAGCTTT
GTAGAACTTA AACAAGTGCT TTTATATTTT TTACGTAGCT TATTTGGAGA AGATATAAAA
ATGAGAATCA GACCCTCTTA TTTTCCTTTT ACAGAGCCAA GCGTAGAAAT AGATATTAAT
TGCAGGATAT GTAATGGAAA TGGCTGCAAT ATATGTAAAC ATTCAGGATG GCTTGAAATT
ATGGGTGCAG GCATGATCGA TCCAAATGTA CTTAAAAACT GCCATATTGA TCCTACTACT
TATACAGGCT ATGCCTTTGG TATGGGCTTA GAACGTATTG CCATGCTGAT GTACCAAATT
AACGATTTAC GTTTATTTAC AGAAAATGAT GTTCGTTTTT TAAAACAATT CAAGGCTTAT
GCCTAA
 
Protein sequence
MLQQLDLIQH EIEQYHPTTL QELEEFRIKF LSKKGTITQL FTEFGQLSPA DKQALGSKLN 
ALKQIAQEKY KIYASQLQST PKQNTDVNED YTLPPPADKL GSRHPISILK DRILEIFEKI
GFSIVEGPEI EDDWHNFGAL NFSPNHPARD MLDTFFISQC PDILLRTHTS SVQIRVAESQ
APPIRAISIG RVYRNETISA RSHCMFHQVD AFYVNKNVSF VELKQVLLYF LRSLFGEDIK
MRIRPSYFPF TEPSVEIDIN CRICNGNGCN ICKHSGWLEI MGAGMIDPNV LKNCHIDPTT
YTGYAFGMGL ERIAMLMYQI NDLRLFTEND VRFLKQFKAY A