Gene Aasi_0813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0813 
Symbol 
ID6377164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1029471 
End bp1030931 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content37% 
IMG OID642681954 
Producthypothetical protein 
Protein accessionYP_001957917 
Protein GI189502200 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.182333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAAC AGCCAATTAT ACTGGTTTTT TATCAAAACT TACGTTTCCA TGACCAGCCT 
TTACTACTTG AAGTGCTCCA ACATACCCAG CCTATCATTC CTATTTTTAT TAATGATCCT
AAAGTTATTG AGCGATTAGG AGAAGCCAGC CAATGGTGGC TATACCAGTC TATACGAGCA
TTTAAGCAAC AGTGGAAATC AGCCTATAAT ATTGAGCTTA TTCTTAGGAC AGGAGATAGC
GTAAGAGTTC TTCAACAACT GTTACAAGAA ACCAATGCCA ACAAGATTTA TTTAGGAAAA
CGCTACACAA AATTAGAAAG GGAAATAGAT GAAAGGATTT ATGAGGAGCT CAATCGAGAT
GGTATAACCA TAAAGTTTTT TAATACCCAC CTCTTATTTG AACCAGCCAA CATAAAAAAC
CAACAAGGAA ATAGCTTTCA AATCTTTACA CCTTTTTGGA AAACTTGTTT AACCAAAACC
ATAGAAGCTG ACCTACCAGC GCCTGATCAC ATCTTTAATG GCTATAACCA ACCCATTAAT
TCTGATGATT TAAGCGATTG GAAGTGGGGA CATAGCCAAG CAGCTTGGAC AAGAAAGTTA
GCCAACCATT GGCATCTATC TGAATTAGCT GCTTTAAATA AGCTAGCTAT ATTTCTTAAG
AATTCCTTAG CAGGCTATAA TAATAACCGA GATCTTATTG CATCACCCAG CTTTAGCTCC
CAGCTATCCC CTTATCTTAG ATGGGGACAA ATAAGCGCTA AGAAAATATT TAATGAAGTC
ATTCATACTA TGGAAAGAGA CCCAACTATC CAACAAGATG GGAATACTTT TTTGAAAGAA
ATAGGCTGGC GAGAATTCTC TTATTATCTA CTGTATCATC ACCCATCCAT GCAAGAAGTT
CCACTGAACA AGCGATTTCA GGACTTCCCT TATGAAAATA ATCTAAGCCT TTTAGAAAAA
TGGCAAAAAG GTACCACTGG CTTTCCTATT ATTGATGCTG GTATGCGCCA GCTTTGGCTA
GAAGGCTGGA TGCCAAACCG ACTACGGATG ATAGTCGCTT CATTTTTAAT AAAAGACTTA
TTAATTAATT GGCAATTTGG ACAGGCATGG TTTATAGACA CTTTAGTAGA TGCAGACCCT
GCCAACAATG CCAATAGTTG GCAATGGGTA GCCGGTTGTG GTACGGATGC ATCTCCTTAT
TTTCGCATAT TTAATCCTAT TACACAAGGA AAAAAATTTG ACTCAGAAGG GAAATATATT
AGAAAATATG TTCCTGAGCT AAAAGATTTA CCAACCAAAT ATATTCACCA GCCTTGGGAA
ATGCCCATTA CTTTACAAGA AGAGTATAAT GTAATAATTG GGAAAGATTA CCCTCACCCC
ATAGTAGATC ACACAATACA AAAAAACAAG GCTTTAGATT GTTGGAAAGA ATTCAAAGGA
AAAGGCTATC TCGCTGGTTA G
 
Protein sequence
MQQQPIILVF YQNLRFHDQP LLLEVLQHTQ PIIPIFINDP KVIERLGEAS QWWLYQSIRA 
FKQQWKSAYN IELILRTGDS VRVLQQLLQE TNANKIYLGK RYTKLEREID ERIYEELNRD
GITIKFFNTH LLFEPANIKN QQGNSFQIFT PFWKTCLTKT IEADLPAPDH IFNGYNQPIN
SDDLSDWKWG HSQAAWTRKL ANHWHLSELA ALNKLAIFLK NSLAGYNNNR DLIASPSFSS
QLSPYLRWGQ ISAKKIFNEV IHTMERDPTI QQDGNTFLKE IGWREFSYYL LYHHPSMQEV
PLNKRFQDFP YENNLSLLEK WQKGTTGFPI IDAGMRQLWL EGWMPNRLRM IVASFLIKDL
LINWQFGQAW FIDTLVDADP ANNANSWQWV AGCGTDASPY FRIFNPITQG KKFDSEGKYI
RKYVPELKDL PTKYIHQPWE MPITLQEEYN VIIGKDYPHP IVDHTIQKNK ALDCWKEFKG
KGYLAG