Gene Aasi_0934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0934 
Symbol 
ID6377037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1198477 
End bp1199571 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content39% 
IMG OID642682065 
Producthypothetical protein 
Protein accessionYP_001958026 
Protein GI189502309 
COG category[L] Replication, recombination and repair 
COG ID[COG2801] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0345872 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTAA ACCAGAAAAT CATAAAACCT AAATTGGGTT TATTAGAGTT AGCAAAAAGG 
TTAGGCAATG TATCCTCAGC TTGTAAGACC ATGGGCTATA GCCGAGATAG CTATTATCGC
TTTAAGGAAC TCTATGAGAA GGGTGGTGAA GAAGGGCTTT ATGAATTAAC ACGTCGAAAG
CCAATCCTTG CCAATCGAGT AGACCCTACT ATAGAAAAGG CAGTACTGAA TATGGCTATA
GAATATCCAG CCTATGGACA AGAAAGAGTG TCCAATGAGC TAAAAAAGCA AGGTATACTT
GTATCTGCTG GAGGAGTAAG ATCTATATGG CTTCGCAATG ATCTAAACAA CCTTAAAAAG
CGATTAACAG CTTTAGAAGC TAAGATGGCT CAAGATGGTA TGGTTTTAAC AGAAGCACAG
CTAAAAGCTT TAGAGAATAA GAAGCAGCTT CAAGAAGCCC ATGGAGAGAT AGAGACGGCT
CATCCAGGCT ACTTAGGATG TCAGGATACT TATTATGTGG GTAATTTTAA AGGTATAGGT
AAAGTATATG GACAAACTTA TATTGACTCT TACACTAGAG TAGCAGAGGC TAAGCTATAT
ACAGAAAAAA CAGCTATAAC TTCCGCTCAT ATCTTAAATG AGCGGGTATT GCCTTGGTAT
GCTGAGCAAG GAATACCTGT TTTGCGTATT ATGACCGACA GAGGCACAGA ATATAAAGGA
ACCTTAGAGA ATCATGCTTA TGAGCTGTTC TTAAGTGTAG AGGGAATAGA ACATACTACT
ACTAAAGCCT ACTCACCACA GACTAATGGC ATGTGTGAAC GTTTTCATAA AACAATGAAG
ACTGAGTTTT ATGACACAGC TATGAGAAAA AAGATATATA CCAGCTTAGA AGAATTGCAG
CGTGATTTGG ATGAGTGGTT ATATTACTAC AACAATGAGC GAAGTCATAG TGGAAAGTAT
TGTTATGGGA AAACGCCTAT GCAAACTTTT AAAGACAGTA AGCACTTAGC CTTGGAAAAG
AACAACGAGT TGTTGTATCT TTCTAGCACA CCAGACAGGC TAGAGTATGC TGACAATTTG
CATCCCCTGT TGTAA
 
Protein sequence
MNLNQKIIKP KLGLLELAKR LGNVSSACKT MGYSRDSYYR FKELYEKGGE EGLYELTRRK 
PILANRVDPT IEKAVLNMAI EYPAYGQERV SNELKKQGIL VSAGGVRSIW LRNDLNNLKK
RLTALEAKMA QDGMVLTEAQ LKALENKKQL QEAHGEIETA HPGYLGCQDT YYVGNFKGIG
KVYGQTYIDS YTRVAEAKLY TEKTAITSAH ILNERVLPWY AEQGIPVLRI MTDRGTEYKG
TLENHAYELF LSVEGIEHTT TKAYSPQTNG MCERFHKTMK TEFYDTAMRK KIYTSLEELQ
RDLDEWLYYY NNERSHSGKY CYGKTPMQTF KDSKHLALEK NNELLYLSST PDRLEYADNL
HPLL