Gene Aasi_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0738 
Symbol 
ID6376765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp946872 
End bp949889 
Gene Length3018 bp 
Protein Length1005 aa 
Translation table11 
GC content34% 
IMG OID642681884 
Producthypothetical protein 
Protein accessionYP_001957850 
Protein GI189502133 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAGAC TGATACTGTC TCACCTAGTA ATAAGTTTCT TTCTATTATT AACATTAGGA 
TGTGGTGGCA ATAACCCTAA CCCTGCAATT ACAAATGATT CGTCAGAATT AGTGAATGAC
GATAGTTTAT CACTGCCAAA TACCCTACCT ATATCGCCCA TTAATAACCC AGTAATAGCT
AGTTCACCAA GCTTACCCGT AGAGACTGAT GCCGTGGCTG TACCAAGTAA TCAAAATACG
ATCGAAGAAG ATGTTTCAAA TGGTTCTAAT ACAATGACAG TTGATGAACG GAAGGCAAAT
TTAGGCTTAC TTACCCCACA ACCTAGGGAT CAACTAATCC ACTTTCTATC TCCTAGAGAG
AAGATAAATT TAGGTTTAAC TATCCCACTC TTGGCCGAGA TCATCCCCTA TATGTCTCCT
GGTCTGATAA ATAAACTCAA AAGAGTAAAT GACTTTTTAA ATATGCATAT AACCGAGCTA
ATAAAAGAAA AAATAATTAA GGATCTATAT GAAAAATATG ATGAAAATAA AGATACTAAC
GCAACCTTTT TACAGCTAGC TGTAAGAAAA GGAAATATAG AAGCAGCTAA ATTCTTAATA
GGTAAAAATA GTCTAAATAA TAGAGATGAA TATCATAAAA CTCTTCTACA TGAAGCTGTT
ACGAACGAAC ATATAAATAT GGTCGTATTT TTAATAGCAA AAGAAGCTGA TATAAACACT
AAGGATAAAG ACGGCAATAC TCCTCTCGAT TTAGCCTTTG AGCATAAGAA TATAGAAATA
ATGAAATTAC TCTTAAAAAA AGAAGGTAAA TTTCGAGATG ATGCTGATGA CAAGAAAAGA
AGCCATTTGT TGAAAATTTT AAATAATGAT AATAGGCCAC TTGTAGTAAT GGGGCTAACC
TTACTGCACT TATTTAATCA TAATAAGGAA TACACCTCAA AGACGAATGC CTCACAGGAT
GCTATTGATA CAGGAAATAG CAACCATGTA AACACATCTC CATATATAAA CGCAAGTGCT
TTGCACCTTG CTATATTAGA AGGTAATTTA GAAACAATTA AGTTACTAAT AAATCAAAAA
GCAGACATAA ATTCAAAAAT CGGAGAGAAC TATACACCTT TACATGTAGC TGCTTACATA
GGAAGAAAAG ATATAATAAA ATTATTAATA GATAGCAATG CTAATATCCA TGCTAAGTGT
AATGATGGTA ATACCCCCTT ACATTATGCT ACTATGCTCA GTCATATAGA AGCAGCTAAC
TTATTATTAG AACAGGAAGC CGAGATTGAG ATGCCAAATG ATTTATGGGA AACACCACTA
CATATAGCTG CTGAACAAGG CCACTTAGGA ATGGTTAAGT TATTAATAGA AAAAGGAGCT
GACTTTAACA CGCAAGACAA AGAGGAAGAA ACACCTTTGT ATAAGGCTGT TAAAGGTGGA
AAGATAGAAG TAATTAAATT TTTATTATTT GAAGGAGCAG ATATAAATAC AAAAAATATA
CATGGTTATA CACTCGTGCA TATAGCAGCC GAAAAGGGGC ACTCAGATAT ATTGATGTTT
TTGTTAAAAA ACGAGAATAT ACATGTACAA GTTAGAGATA ATCGTAATCA AACTCCATTG
CATGTAGCTA TTGGTAGTGG CAATTTAGGA GTAGCAGGAC TGTTACTAAA TTATGGTGCT
AGCATGTGTG ATAGAGATGA TCAGGGAGCT ATTCCTTTAC ATTTAGCTGC TTTAAATGGC
AACATGGAAG CAGTTAAGTT GCTAACAAGC ATAGGCCCCT TACCCCAACA TATAATTGAA
AATGAAGAAT CAACCACACT AATTATACAA ACAAGGTTAG GCATAAATAC GAACAATGAG
CTTGGATGTA CTCCCTTGCA CCATGCTGCT AGCAATGGCT ATATAGAAAT AGTCCAATTA
TTACTAAAAA AAGGAGCAGA TATAAATATT AAGAATAAGG AAGGGTTTAC TCCCTTATAC
TTGGCAGTCA TGAATAATAA TGATATACAT TTGATAACAA CTTTAATAAA GACAGGAGCT
GATATTAACA TTCAAGATAA CCAAGGTAAT ACCGCTTTGC ATTTTATAGT TCAAAAAGAG
CGTTTTGAAT TAATTAGATA TTTTCTAAGT AATGACCCTA ATGTTAATAT TAAAAATACA
AAAGGGCAAA CTCTTTTGCA TATAGCTACC CAGCTGGGCA ATATAGAAAT GGTTAAAAAA
TTAATAGATA AAGGGGCTGA TATTAGTATT CAAGATAACC AAGGTAATAC TGCTTTGCAT
TTTATGTTTC AAAAAGAGCG TTTTGAATTA ATTAGATGTT TTCTAGATAA TGCACCTAAT
GTTAATATTA AAAATACAAA AGGGCAAACT CTCTTGCATA TAGCTACCCA GCTGGGCAAT
ATAGAAATGG TTAAAAAATT AATAGAAAAG GGAGCCAATG TAAATATTAG CATAAACCAC
CATGGGCAAA CCCCTTTACA TCTAGCTCTT GAAAAAGGAT ATACAGGAAT AGCTAGACTT
TTAATAGAAA ATGGCGCTAA TCTAAATGCC AGGTATAAAT ATTTTAATAC ACCAGTCCGT
TTAATTCTTA AAAAAGGATA CACAGAATTA GCTGGTCTTT TACTAGAATC GGCAGATAAG
CAACGTAATA GCCCCCTACA TCTGGCTGCT CAAGGAGGTT ATACAAGAAT GGTGCAACAT
TTAATAGATG CAGGCGCAAA GATTAATTTA GATATTGATT TTACGAATCG AGATGGCAGA
ACACCATTGC ACTTATCTGC AAAACATGGC CATAGAGCTA TAGTCCAATT ATTACTAGAT
GCAAATACTA ACATTGATGA ACAAGATTGT TTTGGGCTTA GTCCTTTACA TCTAGCTGCT
CGAGAAGGCC ATCAAGAAAT TGTTGAATTA CTAATAAGAG TAGAGGCAGA TCTTAACCTA
CAAAATAATG CTGACCATAC AGCCAGAGAT TTAGCTATTC AAAAAGGGCA TACGGCTATA
GCAGGCTTAT TGCCTTAA
 
Protein sequence
MQRLILSHLV ISFFLLLTLG CGGNNPNPAI TNDSSELVND DSLSLPNTLP ISPINNPVIA 
SSPSLPVETD AVAVPSNQNT IEEDVSNGSN TMTVDERKAN LGLLTPQPRD QLIHFLSPRE
KINLGLTIPL LAEIIPYMSP GLINKLKRVN DFLNMHITEL IKEKIIKDLY EKYDENKDTN
ATFLQLAVRK GNIEAAKFLI GKNSLNNRDE YHKTLLHEAV TNEHINMVVF LIAKEADINT
KDKDGNTPLD LAFEHKNIEI MKLLLKKEGK FRDDADDKKR SHLLKILNND NRPLVVMGLT
LLHLFNHNKE YTSKTNASQD AIDTGNSNHV NTSPYINASA LHLAILEGNL ETIKLLINQK
ADINSKIGEN YTPLHVAAYI GRKDIIKLLI DSNANIHAKC NDGNTPLHYA TMLSHIEAAN
LLLEQEAEIE MPNDLWETPL HIAAEQGHLG MVKLLIEKGA DFNTQDKEEE TPLYKAVKGG
KIEVIKFLLF EGADINTKNI HGYTLVHIAA EKGHSDILMF LLKNENIHVQ VRDNRNQTPL
HVAIGSGNLG VAGLLLNYGA SMCDRDDQGA IPLHLAALNG NMEAVKLLTS IGPLPQHIIE
NEESTTLIIQ TRLGINTNNE LGCTPLHHAA SNGYIEIVQL LLKKGADINI KNKEGFTPLY
LAVMNNNDIH LITTLIKTGA DINIQDNQGN TALHFIVQKE RFELIRYFLS NDPNVNIKNT
KGQTLLHIAT QLGNIEMVKK LIDKGADISI QDNQGNTALH FMFQKERFEL IRCFLDNAPN
VNIKNTKGQT LLHIATQLGN IEMVKKLIEK GANVNISINH HGQTPLHLAL EKGYTGIARL
LIENGANLNA RYKYFNTPVR LILKKGYTEL AGLLLESADK QRNSPLHLAA QGGYTRMVQH
LIDAGAKINL DIDFTNRDGR TPLHLSAKHG HRAIVQLLLD ANTNIDEQDC FGLSPLHLAA
REGHQEIVEL LIRVEADLNL QNNADHTARD LAIQKGHTAI AGLLP