Gene Aasi_0650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0650 
Symbol 
ID6376496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp836885 
End bp840331 
Gene Length3447 bp 
Protein Length1148 aa 
Translation table11 
GC content38% 
IMG OID642681805 
Producthypothetical protein 
Protein accessionYP_001957775 
Protein GI189502058 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.394104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAGT TCACCCACTT ACATTGCCAT ACACAATATT CCTTGTTAGA TGGCACCGCT 
AAAATCGACA AACTGTTTGG CAAAGCTAAA CAGCTAGGCA TGCAAGCACT TGCTATTACA
GACCATGGCA ACATGTTTGG AGTACCACAT TTTGTAGCAC AAGCAAAAAA ACAAGGAATT
AAACCTATTA TTGGTTGCGA ATTCTACCTT GCAGCTGATA TGCATAATTT AAAAGAAAAG
ACACGCTATC ATCAGCTTCT ATTAGCCAAA AATGAGGTAG GCTATAAAAA TATAGTAAAA
CTCTGTTCTA TCAGCTTTTT AGAAGGATAC TATTATAAGC CGAGAATTGA TAAAGAGCTT
CTTAAAAAAT ATAGCGAAGG CTTAATTGCT ACCACCTGTT GCCTAGCAGG TGAAGTACCC
CAAGCTATTA TGCGCAAGGG AGAAGAAGAA GCTGAAAAAG TATTTCTATC ATGGCTTAAT
ATTTTCGGTG AAGATTACTA CATAGAGTTA CAGCGGCACG GGCTAAAGGA ACAAGATAAA
TGTAACGAGG TCTTGCTTAA ATGGGCTCAA AAGTATCAAG TAAAGGTTAT AGCAACTAAT
GATGTCCATT ATGTAGAGCA GCAAGACAGC CTAGCCCAAG ATATTTTGTT GTGTTTACAA
ACAGGAAAAG ATTATAACGA TCCCAATAGG ATGCGCTTTG ATGGTGACCA GTTTTTTTTA
AAGTCACCTC AGGAAATGCT AACGTTATTT CATGACATCC CACAAGCCGT TTCCAATACC
CAAGAGATTA TTGATAAAAT CAATACGCCC TCTTTAGAGC GTGATATATT ACTCCCTGTC
TTTCAGATAC CACAAGGTTT TACCAGCCAA GACAGCTACC TTAGACATTT AGCTATAGAA
GGTGCTAAAA AGCGTTTTGG TGCTATTAGT GCAGATTTAG AAGCTCGTAT CAACTATGAA
CTAGGCGTTA TACAACAAAT GGGCTTCCCA GGTTATTTTC TGATTGTTCA GGATTTTATA
CAAGCAGCTA AAAACCTACA AGTTGTCGTA GGCCCTGGCA GAGGTTCTGT AGCAGGTTCA
GTGGTGGCTT ATTGCATTGG CATTACGGAC ATTGATCCAA TACGCTATAA CCTTTTCTTT
GAACGATTCT TGAATCCTGA ACGAGTATCT ATGCCAGATA TAGATATCGA TTTTGATGAC
GAAGGTCGCC AAAAGGTAAT AGAATATGTA GTAGATAAGT ATGGTAAAAA TCAAGTTGCC
CATATTATCA CATTTGGTAG TATGGCAGCC AAATCAGCTA TTCGCGATGT AGCTCGTGTT
TTGGGTTTAC CTCTTGAAAG AACTAACTAT ATAGCCAAGT TAGTGCCTGA AAAGCTGGGC
ATTACTTTGC CAGAATCTTT TGAAGAAGTA CTAGAGTTAG CAGAACTTAA AAAGAATAGC
AATACACTTG AAGGGAAAGT ATTGTCCTTA GCAGAAACAT TAGAAGGTTC TGCTAGGCAT
ACCGGTGTAC ATGCCGCTGG TATTATTATT GCCCCTGATG ATCTTCTGAA TCATATTCCT
GTAAAAACGG ATAAGAACTC AGACCTTTTG GTAACCCAGT ATGATGGCTC TATAGTAGAA
AAAGTAGGTA TGCTTAAAAT GGACTTCTTG GGCCTAAAAA CTTTATCTAT TATCAAAGAT
GCCATAGCAC TTATTGAAAA GCTTCATGGC ACAAAGATCG ACCTTGAGCA GCTACCGCTA
GATGATCTTA AAACCTTTAA GCTATACCAA CAAGGCGATA CAATAGCTAC ATTCCAGTTT
GAGTCTGAAG GAATGCGCCA ATGGCTAAAG AAGTTACAAC CTACAGAGTT TGAAGAGCTT
ATTGCCATGA ATGCATTATA TCGGCCAGGG CCCATGCAAT TTATCCCTAA TTTCATAGCC
CGTAAGCATG GACAAGAAAA AATTGACTAT CCCCACCCGC TGCTAGTAGA TATTTTAAAA
AATACCTATG GCATTATGGT ATACCAAGAG CAGGTTATGC AAACTGCTCA AATTATTGCT
GGCTATAGCT TAGGAGGAGC AGACCTGTTG CGTAAAGCCA TGGGTAAAAA GCAGCCAGAA
GAAATGGCTA AGCAACGGGA AATATTTGTA CAAGGAGCAC AAGAAAAAAA TAACCTCCCT
AGCACAAAAG CCATCGAGAT CTTTGAGGTA ATGGAAAAAT TTGCTCAATA TGGATTTAAT
AGAGCCCACT CAGCTGCCTA CTCTGTTATT GCTTATCAGA CGGCATACCT TAAAGCCCAC
TACCCTGCCG AATATATGGC ATCTGTGCTA ACACATAACC AAAATGATAT TGGCAAGATC
AGCTTTTTTA TGGAGGAGTG TATTAGACAA GGCATTCAAG TGCTAGGGCC TGATGTTAAT
GAGAGCCAAG TTAATTTTGA TGTTACACCA AAACAAGGAA TTAGGTTTGG ATTAACAGCT
ATCAAAGGAG CTGGAGAAGG GGCAGTAAGC CATATTATAG CAGAGAGAGA AAAAAACGGA
CATTTTACAG ATATCTTCTC TTTTGTAGAA CGGGTAGATT TAAGAATGGT CAATAAAAAA
ACCCTTGAAT CCTTAGCCAT GTCTGGTGCT TTTGACGGGT TTACAGGCTG CCATCGCAGA
CAATATCTTT TTGCAGCTGA TGGAGAACAT AATTTGCTTG TAAAAGCCAT TCAATATGGC
AACCAAATAA AACAAGAAAA AGCCTCAGCA CAACAGTCGT TATTTGCTAT GGATGATAAC
TTCCAATATA TTCAAAAGCC ACCTATACCA ACCTGTGAGC CTTACGATAA GATTGAAAAG
CTGCGCATGG AAAAGGAATT AGTAGGCTTC TATATCTCAG GACACCCTTT AGACCAGTTT
AGAGTAGAGT TAGCCAATTT CTGCGATGGC CATACTCAAA ATATACTTAC GTTTAAACAA
AAAGAAGTTA GACTAGCAGG CATAATAACA GAGTGCAGTA TTAAATACAA TAAGCAAGGA
CGTCCTTTTG GCTTATTTAT TTTAGAGGAT TATCATGGTA CACTCGATCT AGCACTCTTT
GGAGAAGATT TCCTTAAAAA TCAGCATATG CTACAAAAAG GAATGTTTGT ACATATTACA
GGAGTAGTTA CAGAAAGATA TAATCAGCAA GACACATGGG AATATAAGCC TCAGAAAATT
AGCTTATTGG GCGAGTTAAG AGATAAACTA GGCAAGAATT TAAGTATTGC TGTACCTATT
GAAAAGATAA CACCTCAATT TATTACAGAA TTAAGGAGCC TCGTCACTAA AAATACAGGT
AATTGCTGTT TAAAATTACA TATAGCCGAT CCGAGCGAAG CCATGCAAGT ATCATTGCAA
GCTTTCAAGT ATCGAGTTTA CCCTTCTGAT GAGCTCTTAC ATACACTCAG CCAGCTCACA
GAGAGCTCTT ATCAACTTAC CCATTAA
 
Protein sequence
MIQFTHLHCH TQYSLLDGTA KIDKLFGKAK QLGMQALAIT DHGNMFGVPH FVAQAKKQGI 
KPIIGCEFYL AADMHNLKEK TRYHQLLLAK NEVGYKNIVK LCSISFLEGY YYKPRIDKEL
LKKYSEGLIA TTCCLAGEVP QAIMRKGEEE AEKVFLSWLN IFGEDYYIEL QRHGLKEQDK
CNEVLLKWAQ KYQVKVIATN DVHYVEQQDS LAQDILLCLQ TGKDYNDPNR MRFDGDQFFL
KSPQEMLTLF HDIPQAVSNT QEIIDKINTP SLERDILLPV FQIPQGFTSQ DSYLRHLAIE
GAKKRFGAIS ADLEARINYE LGVIQQMGFP GYFLIVQDFI QAAKNLQVVV GPGRGSVAGS
VVAYCIGITD IDPIRYNLFF ERFLNPERVS MPDIDIDFDD EGRQKVIEYV VDKYGKNQVA
HIITFGSMAA KSAIRDVARV LGLPLERTNY IAKLVPEKLG ITLPESFEEV LELAELKKNS
NTLEGKVLSL AETLEGSARH TGVHAAGIII APDDLLNHIP VKTDKNSDLL VTQYDGSIVE
KVGMLKMDFL GLKTLSIIKD AIALIEKLHG TKIDLEQLPL DDLKTFKLYQ QGDTIATFQF
ESEGMRQWLK KLQPTEFEEL IAMNALYRPG PMQFIPNFIA RKHGQEKIDY PHPLLVDILK
NTYGIMVYQE QVMQTAQIIA GYSLGGADLL RKAMGKKQPE EMAKQREIFV QGAQEKNNLP
STKAIEIFEV MEKFAQYGFN RAHSAAYSVI AYQTAYLKAH YPAEYMASVL THNQNDIGKI
SFFMEECIRQ GIQVLGPDVN ESQVNFDVTP KQGIRFGLTA IKGAGEGAVS HIIAEREKNG
HFTDIFSFVE RVDLRMVNKK TLESLAMSGA FDGFTGCHRR QYLFAADGEH NLLVKAIQYG
NQIKQEKASA QQSLFAMDDN FQYIQKPPIP TCEPYDKIEK LRMEKELVGF YISGHPLDQF
RVELANFCDG HTQNILTFKQ KEVRLAGIIT ECSIKYNKQG RPFGLFILED YHGTLDLALF
GEDFLKNQHM LQKGMFVHIT GVVTERYNQQ DTWEYKPQKI SLLGELRDKL GKNLSIAVPI
EKITPQFITE LRSLVTKNTG NCCLKLHIAD PSEAMQVSLQ AFKYRVYPSD ELLHTLSQLT
ESSYQLTH