Gene Aasi_1502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1502 
Symbol 
ID6376588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp235106 
End bp236866 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content36% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003572994 
Protein GI294661119 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATC TTATTCGAAT TGCAGCACTT ATTTTAATGC TTCTTGCGCT ATGGGCTTAT 
TTGAAAAGCA AGAAAGACTC CTCACCTAAA GTAACAGAAA AAGTATTGTA TACAGCTAAT
GAGGCACAGA TTAAAACATT AGATCCTGCC CAAGCTGAAG ACCATTATTC TAATAGAGAA
GTAGCTAAAG TTTATGAAGG TCTGTTAGAA TTTCATTACC TTAAAAAGCC ATTCGAGCTA
ACTCCTAATC TAGCGGAAGA AATGCCTGAA GTGTCAGCAG ACCAGCTGGT TTATACCTTT
AAAATTAGAC GAGGTGTAAA GTTTCATGAC AACCCTTGCT TTCCTAATGG TAAAGGGAGA
GAACTGACGG CGCATGATTT TGTATATTCT TTTAAAAGGT TAGCTGATCC TAAGCTTCAA
GCAAAGAACT TTTGGCTAAT CAATAACAAT CTAAAGGAAG TTAATGCATG GAGAGAAAGA
TATGCCGATG CTATACAGGC CAATTATGAC GAAGAAATAG AGGGAGTAAA AGCTATAGAC
CGCTATACAC TGCAGTTCAC TTTAACAAGA CCTAATCCAC AATTTCTATA CTTTTTAGGT
ATGTCGGGAT GTTACGTGGT TCCTCGTGAA GCAGTAGAGC ATTATGGTAT GGAGTTTACT
AATCATCCTG TAGGAACAGG AGCTTTTATG TTAGAAGCTT TTAATCCACA AGATAGTAAG
CTAGTATACC GCAAAAACCC TACTTTTAGA GATAAACGTT TCCCTAGTGA ATCTATAGAA
GAATATAAAC ATATGCTAGC TTATGCTGGG AAGCAGTTGC CTTTTGTAGA CAAAATAGTT
ACTTATATCC TTACTGAGGC ACAACCTAAA TGGCTTAAAT TTAAAAAGGG TGATTTAGAT
ATAATTGATA TTACTAAAGA TAAAATTGCC TTAGATGTAG TGCGAAACGG TGAGTTAATT
CCTGATCTTA AAGAAAAAGG CATTAACCTA TATAGCGTAG CTGAATTAAG TACTACTTAT
GTTGTTATGA ACTGTGCTAA TCCTTTATTT AAAGATAATC TTAAGCTTCG GCAAGCTATG
GCATTAGCAT TTGACAAAGA AGGTTATAAT AAATTGTTTC ATAATAATAC AGCAGTAGTA
GCACAATCAA CTGTTCCTCC TGGGCTAGCT GGCTACAGAG AAGATTATAT AAATCCTTAT
GGTATCTATG ATATTGAAAA AGCTAAACAA TATTTAGCAG AGGCAGGTTA TCCTGAAGGC
AAAGGATTGC CTGAGCTTAC ACTAGATGCG GGGCCTGATG CCGAACTAAG ATTAAAAGGA
GAATTTTTTC AGAAATGCAT GGCTAAAATA GGCGTACGTA TTAAAGTAGT CGGAAATATT
TTTCCAGAAT TAATAAAAAA AATTAATAAT CAAGCTACCA TGCTACATAG TATTTCTTGG
AGTGCAGATT ACCCAGACGC ACAAAATTTC TTCATGCTCC TGTATGGTCC TTACCAACCA
GGTGGCATTG GATCTAATTT AAACGACTCT GCTTATGATG CTTTATATGA AAAGGCTGTA
GCTATGCTAG ATTCTCCTGA AAGAACTAGG CTTTATGAAC AGCTTAATGA AATGATAGCT
GAAAAGACAC CTTTTATTTG TACTGTACAT TTTCCCCATA CTGGTTTGCA GCATGGGTGG
GTCAAGAACT ACTGTTGGTC TAACTTCCAT TATGGCACAG AACAATATTT TGATATAGAT
TTAGAACAAA AAAACAATTA G
 
Protein sequence
MKNLIRIAAL ILMLLALWAY LKSKKDSSPK VTEKVLYTAN EAQIKTLDPA QAEDHYSNRE 
VAKVYEGLLE FHYLKKPFEL TPNLAEEMPE VSADQLVYTF KIRRGVKFHD NPCFPNGKGR
ELTAHDFVYS FKRLADPKLQ AKNFWLINNN LKEVNAWRER YADAIQANYD EEIEGVKAID
RYTLQFTLTR PNPQFLYFLG MSGCYVVPRE AVEHYGMEFT NHPVGTGAFM LEAFNPQDSK
LVYRKNPTFR DKRFPSESIE EYKHMLAYAG KQLPFVDKIV TYILTEAQPK WLKFKKGDLD
IIDITKDKIA LDVVRNGELI PDLKEKGINL YSVAELSTTY VVMNCANPLF KDNLKLRQAM
ALAFDKEGYN KLFHNNTAVV AQSTVPPGLA GYREDYINPY GIYDIEKAKQ YLAEAGYPEG
KGLPELTLDA GPDAELRLKG EFFQKCMAKI GVRIKVVGNI FPELIKKINN QATMLHSISW
SADYPDAQNF FMLLYGPYQP GGIGSNLNDS AYDALYEKAV AMLDSPERTR LYEQLNEMIA
EKTPFICTVH FPHTGLQHGW VKNYCWSNFH YGTEQYFDID LEQKNN