Gene Aasi_0884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0884 
Symbol 
ID6377054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1119545 
End bp1120639 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content38% 
IMG OID642682020 
Producthypothetical protein 
Protein accessionYP_001957981 
Protein GI189502264 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.698912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTAA ACCAGAAAAT CATAAAACCT AAATTGGGTT TATTAGAGTT AGCAAAAAGG 
TTAGGCAATG TATCCTCAGC TTGTAAGACC ATGGGCTATA GCCGAGATAG CTATTATCGC
TTTAAGGAAC TCTATGAGAA GGGTGGTGAA GAAGGGCTTT ATGAATTAAC ACGTCGAAAG
CCAATCCTTG CCAATCGAGT AGACCCTACT ATAGAAAAGG CAGTACTGAA TATGGCTATA
GAATATCCAG CCTATGGACA AGAAAGAGTG TCCAATGAGC TAAAAAAGCA AGGTATACTT
GTATCTGCTG GAGGAGTAAG ATCTATATGG CTTCGCAATG ATCTAAACAA CCTTAAAAAG
CGATTAACAG CTTTAGAAGC TAAGATGGCT CAAGATGGTA TGGTTTTAAC AGAAGCACAG
CTAAAAGCTT TAGAGAATAA GAAGCAGCTT CAAGAAGCCC ATGGAGAGAT AGAGACGGCT
CATCCAGGCT ACTTAGGATG TCAGGATACT TATTATGTGG GTAATTTTAA AGGTATAGGT
AAAGTATATG GACAAACTTA TATTGACTCT TACACTAGAG TAGCAGAGGC TAAGCTATAT
ACAGAAAAAA CAGCTATAAC TTCCGCTCAT ATCTTAAATG AGCGGGTATT GCCTTGGTAT
GCTGAGCAAG GAATACCTGT TTTGCGTATT ATGACCGACA GAGGCACAGA ATATAAAGGA
ACCTTAGAGA ATCATGCTTA TGAGCTGTTC TTAAGTGTAG AGAGAATAGA ACATACTACT
ACTAAAGCCT ACTCACCACA GACTAATGGC ATGTGTGAAC GTTTTCATAA AACAATGAAG
ACTGAGTTTT ATGACACAGC TATGAGAAAA AAGATATATA CCAGCTTAGA AGAATTGCAG
CGTGATTTGG ATGAGTGGTT ATATTACTAC AACAATGAGC GAAGTCATAG TGGAAAGTAT
TGTTATGGGA AAACGCCTAT GCAAACTTTT AAAGACAGTA AGCACTTAGC CTTGGAAAAG
AACAACGAGT TGTTGTATCT TTCTAGCACA CCAGACAGGC TAGAGTATGC TGACAATTTG
CATCCCCTGT TGTAA
 
Protein sequence
MNLNQKIIKP KLGLLELAKR LGNVSSACKT MGYSRDSYYR FKELYEKGGE EGLYELTRRK 
PILANRVDPT IEKAVLNMAI EYPAYGQERV SNELKKQGIL VSAGGVRSIW LRNDLNNLKK
RLTALEAKMA QDGMVLTEAQ LKALENKKQL QEAHGEIETA HPGYLGCQDT YYVGNFKGIG
KVYGQTYIDS YTRVAEAKLY TEKTAITSAH ILNERVLPWY AEQGIPVLRI MTDRGTEYKG
TLENHAYELF LSVERIEHTT TKAYSPQTNG MCERFHKTMK TEFYDTAMRK KIYTSLEELQ
RDLDEWLYYY NNERSHSGKY CYGKTPMQTF KDSKHLALEK NNELLYLSST PDRLEYADNL
HPLL