Gene Aasi_1912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1912 
Symbol 
ID6377640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1698023 
End bp1700542 
Gene Length2520 bp 
Protein Length839 aa 
Translation table11 
GC content32% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573255 
Protein GI294661379 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.790758 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATT CTTATACTCT AAATTGGCAA TTTATAGTCC ATATTTTATT TATAAGCTTA 
TGCGTACAGA GCTGCAGTGG TTTAAATAAT CTACCAGTAG GAATTCCAGG GCCAACCAAC
CACAAACAAA GATTAAATGA ACATAATATT CATCCTTTAC TTGTCCAAAC ACCAATAGAT
CAGAGAGGGC ATGTGGTCAC TTTTTACCAA GAAGGTAGTC AGTTACAAGC CGAGGTTGAG
GAAAAAAACG GATGCTTTAG TAAAATCCAT ACCTTACCTG TGTATATAGA ACAAGGTATA
AATTTACAGG AAGTAAACAG TAAACAACAT ATACATGTGA TTCTACCGAA AGGCCAAGAA
ACAGGATATG TATATGTGGG GCATACAGGT TTAATGGGAG GAGGGGAGAG TAAAGATGAA
GGGGAAGAGG AAGATAACGA GAATCAAAAG AATGAAAAAG CTAAAAGTCA ACAAAAAAAT
AAAAGTAAAG GAAAAGAAAA ACTAAATAAA TCAAAGGAGA CACTAGAAAA ACTTGATAAA
GGAAGTGAAA GAGGAAAATC TAAAAGAGTT GGGGCCAATC AGCATACGAG CATAAGCTCT
GTTTCTACAG CACAGCGAAG AATTATCAAA GAGATAGGAA ATTTAGGATT AGACATTTAT
ACTCAAGAAA TTATAACTCC TGAAAATATG CTTGCTTTAA ATGATTTAAG AACCATATGG
AAACTATTGT TTTCCGGAGT CAATTCTTTA AAAGCATATC AAGATAAAAA AGAAGATATT
ACAATGAGCA TAACTCTTTT TGTAAATGAC TTGAGAGCTT TAAATAAGAA AAACTATGAT
ATAAATGATA AGTTATACTT AATAACGGCA GCCAGTGAGG CGGTTGAATA TATCAAACAT
ATACTTGATG ACGAGTTTTT TTATGAGGGT TCGATTGAAG ATAAAGTAAC GTTGGCAAAT
CTATTTTATG AATTAAATTC AGGAATTGGA ATGGATAATT TATTGCTCTG TTCTATCCTC
AAAACAGGAT ACCCACAATT TGCAGAGCAA AATCCTCAAA TAATTAAAAA ACATATGGAT
GGGTATAAAA ATGCTCGAAA TCAATTGAAT CTTCGATTTG GTATACCTAA GAACAAGATA
GCGACGATAT ATAAGGATAC CCTTGCTACA GCATACACCA ACCTTACATT TGAACAGGAA
ACTCCCAAAA CAAGGATTAA AGCAAGCGAA CGTGTACTCG AAGTACAGCA ACAAAATAAG
TGTATTCGAG GCAAAAGAAT AGCAGAAAGT GTTCCTACTC AAACTCCTCT TTTTAAATTT
TCTACTGAGA GCCTGAAATT AAGTAATAAT ATATACAAGA TAAAAGATCC AAGAAATATA
CCTGCTACAG AACTAGAAGA GTTATATAGC CAACTTATAA AGGTAGAAGG GTTGAGCCTT
TCACTATTGG TTTTTAACCA ATGTTCTCCT ATACAACTTG TAGGCAATCA TTTGGATTTT
ACAGTTGATA ATTTAATTAG CTCTTATAAA AACATCCTTG GCACATTTCC TAAATATGAT
ATTTATAATT TTTTAAAATG GGGGATTATC ATCTATATGA AAAACAACCG AACTGCTGAA
GCATTAATTC GGTTAAAAGC TATGAAAGTT TTCTATGATT ATTTTTCAGA AGATGCCAAG
ATAAAATTTG AGAGAGATTT TAAGATTTTT CAAGCGAATG CATATGCTGC ATGTGGGGAA
CACGATAAAT TATCTCAACT TTATAAAGAA AAAGTTCAGG CAAAACTAGA AAAGGAAAGA
ATTCTTAAAG AAAACCGTAA GAAGAGTGTA CAAAAATTTA AAAATGCTCA ACAAATGGTG
CAATTGGAGC AACCACATTC AGGGCCACTT GCTACCACAA AAGCAGTAAG AAAACAAGTA
TTAAATACAG ATCCTTCCAC TACTATAAGT GAAAAAGTAT ACCAGGACGA GCAACAACGT
AAAAAAGAAG AAGCAAATGC TAGAGCAAAA AGGCATCAAG AAGCAGAAGA GGTGCGTCTG
CAAAAACGAT TAGAAAATAT ATCTCTTAGT AAGGAAGAAA ATGAGAATTT TATTCCTCCT
TCTCGAAATG GAGAATTAAG AGGACAGCTA ACAGATACAA TCTTTTCTAA AAATTCTTCT
TCTGTCCATT TTATTCTTCC ACAAAAAGCA TGTAAGACAC TTAATAAAAT CTTTGCTAAT
AATTGGAATA TTAGTCGTAA GGATATTGAA AATCTATTCC GTGTTTTAGG ACAAACTATT
AATACAAGTA CTAAATCATC TCATCATGTA ATTGAAATCG ACCAAGAAAC ATTTTTCCTA
GTTAATGATG CTGGAGATAC AATTGATGTA ATTACTGATT CATCAGGTTA TATGAGCGGC
CATTTAAGTT TACCTAATTG GAAAGAGAAA GTTAAAAAAT ATATGCGAAA GAAAATTTTG
CGCGTACTAT GTCATATAGG GATCAATGAA CATAATTATT GTAAGCATAA TACAGTTTAA
 
Protein sequence
MKHSYTLNWQ FIVHILFISL CVQSCSGLNN LPVGIPGPTN HKQRLNEHNI HPLLVQTPID 
QRGHVVTFYQ EGSQLQAEVE EKNGCFSKIH TLPVYIEQGI NLQEVNSKQH IHVILPKGQE
TGYVYVGHTG LMGGGESKDE GEEEDNENQK NEKAKSQQKN KSKGKEKLNK SKETLEKLDK
GSERGKSKRV GANQHTSISS VSTAQRRIIK EIGNLGLDIY TQEIITPENM LALNDLRTIW
KLLFSGVNSL KAYQDKKEDI TMSITLFVND LRALNKKNYD INDKLYLITA ASEAVEYIKH
ILDDEFFYEG SIEDKVTLAN LFYELNSGIG MDNLLLCSIL KTGYPQFAEQ NPQIIKKHMD
GYKNARNQLN LRFGIPKNKI ATIYKDTLAT AYTNLTFEQE TPKTRIKASE RVLEVQQQNK
CIRGKRIAES VPTQTPLFKF STESLKLSNN IYKIKDPRNI PATELEELYS QLIKVEGLSL
SLLVFNQCSP IQLVGNHLDF TVDNLISSYK NILGTFPKYD IYNFLKWGII IYMKNNRTAE
ALIRLKAMKV FYDYFSEDAK IKFERDFKIF QANAYAACGE HDKLSQLYKE KVQAKLEKER
ILKENRKKSV QKFKNAQQMV QLEQPHSGPL ATTKAVRKQV LNTDPSTTIS EKVYQDEQQR
KKEEANARAK RHQEAEEVRL QKRLENISLS KEENENFIPP SRNGELRGQL TDTIFSKNSS
SVHFILPQKA CKTLNKIFAN NWNISRKDIE NLFRVLGQTI NTSTKSSHHV IEIDQETFFL
VNDAGDTIDV ITDSSGYMSG HLSLPNWKEK VKKYMRKKIL RVLCHIGINE HNYCKHNTV