Gene Aasi_1520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1520 
Symbol 
ID6377752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp306702 
End bp308489 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content36% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573007 
Protein GI294661132 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC CGCTTGCCAG CTCCTTTTTA CAGTTATTAC TTTACATGCG CCCTTTTCGC 
AGGATATATC TTTCTGCGAC TTTATATTCT TTTCTAAATA AATTATTTGA TCTCATGCCA
GAGATATTGC TAGGTATTGC TGTAAACACA GTAGTAGCTA GGGAACAATC ATGGCTAGCC
AATTTAGGTT TTTGTGACCT TAAGATACAG CTTATCTTGC TGGGCTTAAT GACAATGGTT
GCCTATGGGC TAGAATCATT ATCTGAATAT TTATTTTCTA TCCGATGGTG GAATCTTGCT
CAAATTGTAC AACATAATTT TAGAATGCAA GCCTTTGAAC ATGTTCAAAA AAGTACTATC
ACTTCTTTTT CTAAACAAAA AACAGGAAAT CTTCTTTCTA TTCTCAACGA TGATATCAAC
CAGCTTGAAA GATTTTTAGA AGAGGGTATA GATAAAATTA TTGAGGTTAT TGGTACTAGT
ATCTTTGTAG GTAGTATCTT CTTTTTCCTC GCGCCTCAAA TAGCAATATT TGTTGTTATT
CCTATCCCAA TTATTATATA CAGTACCTTT CGGTTTCAAA AAAAGCTAAG TCCATATTAC
CTAAATATAA GGGAAAAAGC AGGGCTTGTA GGTGCTTTTC TAGCAAACAG CCTATTAGGG
TTATTAGCAA CAAAAAGTTT AGTAGCCGAA CAACTTGAAA AAAAGAAACT AGAAAAAGCT
AGTATGGCTT ATAAAGATGC TAGCTTCAAT GCTATCCGTT GGGGAGCACT TTTGGTCCCT
ATCATACGCT TTGTTATATT GTCAGGTATC TTAGTAACCT TAATTTATGG GGGTAAATTA
ACGATAGAAC AAAAGTTAGA TGTAGGTGTT TACAGTATTC TTATATTCCT TACACAACGG
TTACTTTGGC CTTTTACAGA AATAGCAGAT ATCATGATTA ATTTCCAGCG CGTAATGGCT
TCTACCCAGC GCCTGTTAAA TTTATTTGAA TTACCAACCG AAAACTCCCC TGATAATATA
GTGCCCATTA AAGGAAGAAT TACATTTGAT GATGTTAGTT TTTCTTATCA TAATCATACA
CCTAGTTTGC ATAACCTTAC CTTTGCAACA GAACCCGGAC AGCATATAGC CTTCGTAGGT
GCCACAGGAG CTGGTAAATC CACCTTATTA CATCTGCTAT TAGGGTTTTA CCTACCTACT
TCAGGTAAAA TTTTCTTTGA TAATAAAGAA ATCCGAGAAC TTTCTCTTCC AGGCTTAAGG
AAACAGTTAG GTTTTGTAAG CCAAGAACCC TTTCTTTTTG AAGGTACTAT AGCTGAGAAT
ATTAGCTATG GTTATGTAGA AGCTACTCCT GAACAAATTA TAGAAGCAGC AAAAAATGCA
GCAGCACATG AGTTTATTAT GAGGCTTCCA GAAGGGTACG ACACAATAAT TGGAGAACGT
GGCCAGAACC TGTCAGGAGG GCAAAAACAA CGCCTTGCTA TTGCAAGAGC TATTGTACGT
AATCCAACTA TTCTTATTCT AGATGAAGCG ACTTCTTCGG TTGATAATGC TACTGAATTG
GCCATTCAAA GGTCATTATC TAAGATTGGG CAAGGAAGGA CAATGATACT TATTGCTCAT
CGACTTTCTA TGGTTAAACA TGCCGATAAA ATCTTTGTAT TAAAGAAGGG ATCAATTGCA
GAGCAAGGAA CACATGAAGA ATTGCTCCAG CATGATAATG TGTATGCTAA TCTTTGGAAG
CTACAAATGG GCGAAACATT AACACATCCT GAACTGATTA TTGATTAA
 
Protein sequence
MKKPLASSFL QLLLYMRPFR RIYLSATLYS FLNKLFDLMP EILLGIAVNT VVAREQSWLA 
NLGFCDLKIQ LILLGLMTMV AYGLESLSEY LFSIRWWNLA QIVQHNFRMQ AFEHVQKSTI
TSFSKQKTGN LLSILNDDIN QLERFLEEGI DKIIEVIGTS IFVGSIFFFL APQIAIFVVI
PIPIIIYSTF RFQKKLSPYY LNIREKAGLV GAFLANSLLG LLATKSLVAE QLEKKKLEKA
SMAYKDASFN AIRWGALLVP IIRFVILSGI LVTLIYGGKL TIEQKLDVGV YSILIFLTQR
LLWPFTEIAD IMINFQRVMA STQRLLNLFE LPTENSPDNI VPIKGRITFD DVSFSYHNHT
PSLHNLTFAT EPGQHIAFVG ATGAGKSTLL HLLLGFYLPT SGKIFFDNKE IRELSLPGLR
KQLGFVSQEP FLFEGTIAEN ISYGYVEATP EQIIEAAKNA AAHEFIMRLP EGYDTIIGER
GQNLSGGQKQ RLAIARAIVR NPTILILDEA TSSVDNATEL AIQRSLSKIG QGRTMILIAH
RLSMVKHADK IFVLKKGSIA EQGTHEELLQ HDNVYANLWK LQMGETLTHP ELIID