Gene Aasi_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1094 
Symbol 
ID6377696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1408265 
End bp1409323 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content33% 
IMG OID642682206 
Producthypothetical protein 
Protein accessionYP_001958166 
Protein GI189502449 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.38305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCGA AAAGTTTGTT CACAATAAAT AAATGGTTTC AAATTATTGT AAGCTTATTG 
TTAGTTATGC TTACCTACGG CTTTATATGG AGTTATCGGA TTATACAAAA ACCTAATATT
CTAGTAGGGC AACCATCTAG ACTACTCTTT ATACCACCCA ATACTACTTT TAATACGCTT
CAAAATACGT TATATAAAAA TGGATATATA ACTGACAGCA CATCTTTCAG ACTGACGGCT
CATCTTTTAA GATATGATCA CAAAATTTTA CCTGGAGCTT ATAGACTTTC TTCTGGAATG
AGCAACTGGA AAGCAATCCA ACTTTTGAGA GCTGGCATAC AAGAACCAGT TAACATTATT
TTGAATAATA TAGCTAATAA AGAAGAGCTA GCTACTAAAA TCACGCAAAA CATTGAAATA
GATGCTATAA CATTCCAAAA ACTATTAGAT GATTCAAAGT TTTTGCAAGC TTATGGATTT
ACACCAGAAA ATATCTTAAC AATGTTTATA CCCAATACTT ATAATGCATA TTGGACTATT
TCTACTGAAA AGCTATTTAA GAGAATGTAT GCTGAGTATC AGAAATTCTG GAAAGGTGAG
CGTTTGGAAA AAGCTAAAAA TTTAAATTTG ACACCTATAC AAGTATCTAT TCTAGCTTCT
ATTATAGAGA AAGAAACCAA CAAACTAGAA GAAGCGCCCC TGATAGCAGG TGTGTATATC
AATCGTTTAA GAAGAGGCAT GAAACTTCAG GCTTGCCCAA CTCTATTATA TATTGCTAAT
GACCCTTCAG CAACACGTGT GCTACATGCC TATATACATA TCAATTCTCC TTATAACACT
TACCTTTATA AGGGTCTTCC ACCTGGCCCT ATTACCATGC CATCTATTGC TATGATAGAT
GCTGTACTCA ATTATCGACA TCACGATTAC TTATATTTCG TAACCAAAGA AGATTTTTCT
GGTTATCACT ACTTTGCTAA AACTTTTAAA GAGCATAAGG AAAATGCAAA GAAATATAGA
AGAACGCTTA AAGAAATATT AGCTGCCAAC AAAGAGTAA
 
Protein sequence
MQPKSLFTIN KWFQIIVSLL LVMLTYGFIW SYRIIQKPNI LVGQPSRLLF IPPNTTFNTL 
QNTLYKNGYI TDSTSFRLTA HLLRYDHKIL PGAYRLSSGM SNWKAIQLLR AGIQEPVNII
LNNIANKEEL ATKITQNIEI DAITFQKLLD DSKFLQAYGF TPENILTMFI PNTYNAYWTI
STEKLFKRMY AEYQKFWKGE RLEKAKNLNL TPIQVSILAS IIEKETNKLE EAPLIAGVYI
NRLRRGMKLQ ACPTLLYIAN DPSATRVLHA YIHINSPYNT YLYKGLPPGP ITMPSIAMID
AVLNYRHHDY LYFVTKEDFS GYHYFAKTFK EHKENAKKYR RTLKEILAAN KE