Gene Aasi_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0468 
Symbol 
ID6377719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp570287 
End bp571795 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content37% 
IMG OID642681628 
Producthypothetical protein 
Protein accessionYP_001957607 
Protein GI189501890 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTTA AATCACTACT AAGTAAGCCA TTAGCCACGT GGGTAGTACG TAATCAAAAG 
CAATGCTTCA AAAATCCAGT ACGTATCCAG CAGAATATTT TTCATAAGCT CATTCAGCAA
GCTAAGCATA CGCTGTTTGG GCGTGCCCAT AACTTTAATT CTATTCGTAC GCATGAAGAT
TTTAAGCAAT ATGTTCCTAT TAGGGCCTAT GAGGATTTTA CAGGGTATAT AGAGCAAATT
AAAGGAGGGG AAAGCGACGT ATTATGGCCT GGAAGTCCTA TTTACTTTGC CAAAACGTCT
GGAACTACAG GTGGAGACAA GCATATACCC ATTACCAAAG AGTCTATCAA ACATCATATT
GTCAATGCTA GGAATGCCTT GCTATATTAT GTTAATGAGA CAAGCAAGAC TGACTTTTTG
AAAAGGAAAA TGATTTTCTT ATCTGGTAGC CCGCAGCTAA CGACCGAAGC AAATATCCTT
ACTGGTAGGC TATCGGGCAT TGTGAATCAT CATGTGCCTT CCTATCTACG TGGTAGTCAG
CTTCCTAGTT ATGCTACTAA CTGTATACCA GATTGGGAAA CTAAGTTGGA TAAAATTGTG
GAGGAAACGT TACAGGCTCA AATGGGGCTT ATATCTGGAA TACCACCTTG GGTACAAATG
TATTTTGATA AACTTACACA AGAAACAGGT AAGCATATTA GTGAAATATT TCCAGATTTT
TCCTTATTGG TACATGGTGG GGTAAATTTT GAACCCTATC GTCATAAGCT TTTTGACTCA
ATAGGTAAAG CAGTAGATAC TATAGAAACT TACCCTGCTT CTGAAGGGTT TATTGCTTTC
CAAGATTCCC AACAGGAAAA AGGACTCTTG TTACAATTAG ACAGTGGTAT GTTTTTTGAG
TTTATCCCTA CTATCAGCTT AGCTTCTCCA ACTCCCAAAC GTTTATCTAT AGAGGAAGTA
GAATTGGGTG TTGATTATGC TCTTGTCTTA TCTAGTAATG CAGGTTTATG GGCTTATATG
CTAGGAGATA CTATTAAATT CATTTCCTTA GAACCTCCTA GAATTGTGGT GACAGGGCGT
GTAAAACATT TTATATCTGC TTTTGGAGAG CATGTAATCA TAGAAGAGAT AGAGAAGGCT
ATGCAATTTA CACTAAATAA GTATCCACAA GTTAGGGTAA CAGAGTTTAC AGTAGCGCCC
TGGGTGAGTA AGCAAGCTGG TGAGGATTCT TATCATGAAT GGCTAATAGA ATTTAGTTAT
CCTCCACAAA ATATAACTAC CTTTGCTTCT GAACTTAACC GACAAATGTG TTTGTTGAAT
AGTTATTACA AAGACTTGAT AGAAGGCAAT ATTCTTAGTA CTTTAAAAGT AACTTCACTA
CAATCAGGAG CTTTTAAAGA ATATATGCGG CAAGTGGGCA AGCTAGGAGA ACAGAATAAA
ATAGTCCGCG TAGCAAATGA CAGAAAAATA GCAGATGCTG TTACTAAATA CAAAATATCT
GATTTGTAA
 
Protein sequence
MNFKSLLSKP LATWVVRNQK QCFKNPVRIQ QNIFHKLIQQ AKHTLFGRAH NFNSIRTHED 
FKQYVPIRAY EDFTGYIEQI KGGESDVLWP GSPIYFAKTS GTTGGDKHIP ITKESIKHHI
VNARNALLYY VNETSKTDFL KRKMIFLSGS PQLTTEANIL TGRLSGIVNH HVPSYLRGSQ
LPSYATNCIP DWETKLDKIV EETLQAQMGL ISGIPPWVQM YFDKLTQETG KHISEIFPDF
SLLVHGGVNF EPYRHKLFDS IGKAVDTIET YPASEGFIAF QDSQQEKGLL LQLDSGMFFE
FIPTISLASP TPKRLSIEEV ELGVDYALVL SSNAGLWAYM LGDTIKFISL EPPRIVVTGR
VKHFISAFGE HVIIEEIEKA MQFTLNKYPQ VRVTEFTVAP WVSKQAGEDS YHEWLIEFSY
PPQNITTFAS ELNRQMCLLN SYYKDLIEGN ILSTLKVTSL QSGAFKEYMR QVGKLGEQNK
IVRVANDRKI ADAVTKYKIS DL