Gene Aasi_1878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1878 
Symbol 
ID6377375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1602558 
End bp1604978 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content33% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573238 
Protein GI294661362 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCG TTGCTGTTTC TTGTAAAGAC TGCAATAATG GCAATCCTGC ACATACTCCT 
AATCCTGATA TAGAAGATAC AAAACCTGTT TACCAATTGA TTTTAGATGG TATAGGTGCT
TCAGCTTCTT TAGAAGGCAT TGAAACATAC TCCTTTTTTA TAGTAAATAA CGATAACCAA
AATACAGTTC CTTCAGATGA AATCTTACTC TCATTAGAAA GTGAGATAGA ATTTACACTA
AATAATTATT CAGTAGATAG TCAAGGGCTG ACACTCAAAA AAATTCTAGA AGTAGACAAA
ATTAAACCAA GTGCTCCTCA AGAAGTTATT TTAAAATTAA AGAATGCTAA AGATAGGAAC
GTTGCTGCAT CCATTAAGTT ACAATTAAAA GACAAGACAG GTAATAAAAT AGGAGAGGAG
AAAAGCTTAG TATGGAGGCC TAAAGTAGGA ATTCAGTTAG ACCTACAGAT AACTAATAAA
AACTTGAAAG GTAATGAGAA GGAAAATAAA AAAATTGAAT TTAAGGTAGC TTCATTGGGT
ACAATCATGC CTGATAAAGA TGCAATAAGT TTAAATTTAA TACCTGAAGA TGGTATTACT
GCTACCATTG TAGGTGCTAG CCAAGCAACT GTGAATGGTA AAATAATTTA TACTTACCAA
GTTAAAAAAG AAGATATAAA TAAAACAATT ACTTCATTGA GTATTGACCC ACAAGGAAGT
AAACAAGCTA GCTTTAAAGT ACAGCTTGTA TATGATGGTA ATCTGGTTGG GCCTGAACAA
ACTTTGACAT GGCAAGCTGA CGAACCATTA GCATTTGCCT TTGAAGCATC AGAAGAAAAA
TACAAGGATA TTGTAACAGG AATGAAATCA TTAATAGGTA CAGATATAGT GGAGATATCT
ATTAAAAACT TAGGGAAAGA TACAGAAGAC GACGAAGTTC TGTTATGTGT TGAACAGGAT
AGCCCAGTAG AGGAGATAGC GTTTGAAGTA TATTATAACT ATCAAAATGT TGACCAAATA
GGTGCCGGAA CCCCTTCCAT AGCTTTTAAA AGTCAGAAAA AAGTAGATGT GGAATTGTTA
AATCTTTCTA ACGATGGTAC TGCTATTAAA AAAGGAGATA CTATTAAAAT TGCATTGCAA
CTAATAGACC CTAAAGTTAA GCAACAGGCT AGTATTACTT TTAAAATGAA AAGTAAGAAA
GATAATCAAG ACATAGCTAC CCCAGTAACT ATTAATTGGA GGGCAGTTAC AACTACTTCG
AAAAGTATAG AAGTCACTCA GCAGATGCTT AATGAAATCT TTTCCAAACC TGGATGCAGA
AGGTTGTACG ATGTGCTAAA AGATATTAAA GATGGTAAAG TTGGACAAGA ACTTGATATT
AATAAAACAG ATTCTCACCA TAGATTAGGT TATACAGCTC TACTAGAGGC TATTAATATA
GGACGAGAAG ATATAGTAAC TTTATTATTA GACAAAGGTG CTGATGTGAA TACACCAAGT
ATACAAGGTG AAACGCCAAT TCAAGTAGCT ATTAGAAAGC TTGATGTAGA AATTGTGAAC
TTATTATTAG AACAGAAGAA GCTTGATTCA AATATAACTT ATGAGAAAGG TAAGAAATTG
CTACTGAATT TAGCTATAGA AAGCAAGAAT GTAACAGGAG ATATAGAAGA AGTTACTAAG
ATAGCTGATA TGTTATTGGA TAAATTAGAT ATAGATACTA TCATTCGAAT TAATGGAAGT
GGGCAAGAAT CTCCAATTTT ACTGGCTATC CAATGTAGGC GTACCCAGTT GGTAAAAAAA
CTTTTAGCAA AAGGCTTTAC TCCTGATTTA AAAAATAAAC AAGGGGAAAC TGCAATCCAT
TTGGTTGCTC GGTATAATCA GAGAGAGTTA GCTGAACAAT TGATAGCAAG AAATGTAGAG
CTAGACGTGC GGGATAATAT AGGAAATACT CCTCTCCACA TTGCTGCAGC CCTACCTAAA
AATAAAGAGA TAGCTAAGCT ATTGATTGAT AAATTTCAAG AGAAAGGAAT TAGTTTAGAT
TTAGTTAATC AGCTTGGCCA AACACCTCTG CACAAAATAG CTGGCAATTC TAATGCAGAA
AATATAGAAA TTATGGAAAA CTTATTGCAA GCAGGTGCCC AGCCAAATGT ACAGGATAAA
AATGGCAGTA CTCCACTCCA TTATGCTATA GGTGCTAAAT ATAGGAATAT TATTGAGGAA
TTAATAAGAG CAGGTACTCA GATGGATATA CAGGATAATC AAGGAAATAC ATCTTTACAC
TTACTAGTTG CTAACAATTA TGTGGACATA GTAAGAAGTG TAATAGCTAA GAGTCCTAAT
CTTAAGAATA TAAAAAACAA GGCTGACAAA TTGCCTAAAG ACTTGGCTAC AACTCCTGAA
ATGAAGGCCT TATTTAATTA A
 
Protein sequence
MSLVAVSCKD CNNGNPAHTP NPDIEDTKPV YQLILDGIGA SASLEGIETY SFFIVNNDNQ 
NTVPSDEILL SLESEIEFTL NNYSVDSQGL TLKKILEVDK IKPSAPQEVI LKLKNAKDRN
VAASIKLQLK DKTGNKIGEE KSLVWRPKVG IQLDLQITNK NLKGNEKENK KIEFKVASLG
TIMPDKDAIS LNLIPEDGIT ATIVGASQAT VNGKIIYTYQ VKKEDINKTI TSLSIDPQGS
KQASFKVQLV YDGNLVGPEQ TLTWQADEPL AFAFEASEEK YKDIVTGMKS LIGTDIVEIS
IKNLGKDTED DEVLLCVEQD SPVEEIAFEV YYNYQNVDQI GAGTPSIAFK SQKKVDVELL
NLSNDGTAIK KGDTIKIALQ LIDPKVKQQA SITFKMKSKK DNQDIATPVT INWRAVTTTS
KSIEVTQQML NEIFSKPGCR RLYDVLKDIK DGKVGQELDI NKTDSHHRLG YTALLEAINI
GREDIVTLLL DKGADVNTPS IQGETPIQVA IRKLDVEIVN LLLEQKKLDS NITYEKGKKL
LLNLAIESKN VTGDIEEVTK IADMLLDKLD IDTIIRINGS GQESPILLAI QCRRTQLVKK
LLAKGFTPDL KNKQGETAIH LVARYNQREL AEQLIARNVE LDVRDNIGNT PLHIAAALPK
NKEIAKLLID KFQEKGISLD LVNQLGQTPL HKIAGNSNAE NIEIMENLLQ AGAQPNVQDK
NGSTPLHYAI GAKYRNIIEE LIRAGTQMDI QDNQGNTSLH LLVANNYVDI VRSVIAKSPN
LKNIKNKADK LPKDLATTPE MKALFN