Gene Aasi_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0228 
Symbol 
ID6376696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp255728 
End bp257980 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content37% 
IMG OID642681416 
Producthypothetical protein 
Protein accessionYP_001957401 
Protein GI189501684 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.871268 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAA GGAAATATCC CGTAGGTTGT CAACTCATAA TCTACATATT ATTACTTGTA 
AGCTTGTGCC TACAAAGCTG CTCCCATTCT ACCAATTTAC CTTGTGCTTC TGTTAAGGAA
GAATCGGTAG TACAAACACA AGAAACAATG CATCAAGCAG GCATTTCTTC GATACTTAAC
AAAGAGTTTA GTGCACAAGC GGGCCATATC CTTACTTTCT ATAAAGAAGC TGGTCAGTTG
CAAGCAGTTG TAAAAGAAAA CTGCCCTATC GGCTTTAGTA AAACACATAT TTTACCTGTA
TATATTGAGC AAGGGGCAGA ATTATCGGAT CTACTTCGAT TAGGCGAGCA AGCACAACAA
CGACGTATTC ATATTCAATT AGCACAAGGA CACCAGCCAG CTAAAGTAGT TATCTATAAA
GGAGCAGGGT TGATGGGGGG CGGTAAAAAA AAGAAACAAT TAACTGATGA GCAGAAAGCT
ATACAAAAGG AAAAGAGGGC CAGAAAGGAA GAAGAGAAAG CTAAGATGTG TAAAGAGGAG
AGATCCAGAA AGAGAGAAGA GGAAGTTAAG CTGAAAAAAG AGAAAGCTAG GTTGAGAAAG
GAGGAGAGAG CTAGAAAGAG AAAAGAAGAG AATGTTAGAC GGGAGGCCGA AAAGGTAGCT
CTTAAAAATA CCCCTTTACA TGAAGCTATT TTAGAGGGGA ATGCTACAAA GCTTTATGAG
CTGGTTCATT CAGGAGCAGA TATATACGCT AAGGGTAGGT ATGGAACTAC TCCCTTACAG
TTGGCAGTTA GGAAGTCTGA TGTAGAGTTA ATAAGCTTAT TATTGGACCA AAGGGCAGAT
ATAAATAAAG ATAAGATATC TAAGCTTTTA TACTTGGCTA TCAGAAGATC GGATGTGGAA
GTAGTAAATC TATTATTAGA ATATGGGGCA GATATAAATT CTCGGGAGCA TAATGGAATT
TCTCCATTAC ATGTAGCTGT TGATGAGAAC CGAACGGAAG TAGTAAAGCT ATTATTAGAA
CAAGGAGTAG ACTTAAATGT TAGGAATAAT TACCAAAATA CACCATTACA TTGGGCTATT
AGAAAGGGTT ATATAGGAGT GGCTAAGTTA TTGGTACAGT ATGGGGCAGA TATAAATGCT
CAAGGAGAAT ATGGTGCTTC TCCATTACAC ATAGCTGTGG CAGAAAGCCA AATGGAATTA
GTGAAGCTAT TTTTAGAACA AGGGGCAGAT ATATATGTGA AGGGTAAGTA TAATGATCTT
GTATTACATT GGGCTGCTGC TCGAGGGAAT GTGCATATAA CCAAGCTGCT ACTAGAGCAT
GAAGCATATA TATGTGCTAA ACTTAATTGG GATATTTTTC AAAGATCTGT GGAGTCAGTA
CATTCCTTAG TAGAACGTGA ATTTGCTATA AATAGTAAAG ATGACTCTGG TGATACACCA
TTGCATAAAG CGGCCAGAAA TGGACACTTA GAAGTGGTAG AGATATTACT AGACCAAGGA
GCTAACGCAA ATGCTACAAA TATCAAGGGG CTAACTCCTT ATCAGGTTAC TAAAGAAGTT
AGCATATTAA CCGTATTAGG TGCTCTATCT ATAACAGATG TAAATGTTTT GGATAAAAGA
GTTGAAAAAG AAAATGTTTC AAACGAGTTA AAAGAAGCAC ATAAGGGCGG TGTAGATGTA
ACTTCAGCAA GTGCCAATAG TCTTGCGAAA GATAAATATG ACGATACGGA TATTATAAGA
AAAAATAAAT CCTTACAAAA AGCTATTGTA AGGGGAGATG TAAAAAGAGT CAGTAAGTTA
ATAAATATAG GGTTAGATAT TAATGCCAAA AATATAGATG GCAATACACT TTTATATTTG
GCTGCACAAA ATTCTTGGAT AGAGGTAGCT AAGCTTTTGA TTGAAAACGG TGCTAAAGTT
AATGAAGTTA GTAAGAATGG AGAAATTCCT TTACATTCTG TTGCTGAGAA GGGACAGTTA
GAGTTAGTTG ATCTATTGGC GGAACAAAAA TCCAATTTTA ATGCTAAAAA TATTACGGGG
AATACGCCTT TGCATTTAGC TGTTATAAAT AATCATGTGG AAGTAGTGCG TCTACTACTG
CAATTAGGAG CTAAGTGGAA TGTTGAAAAT AAATCAGGCC GTACACCTCT TCAGTTTGCT
ATACGAAAGG GCTATACAGC AATAGCCGAT TTAATAATTA GCAAAGAGAA AGGATATATG
AGCGAGGAAG AAGATACTTA TAATGAACTT TAA
 
Protein sequence
MIKRKYPVGC QLIIYILLLV SLCLQSCSHS TNLPCASVKE ESVVQTQETM HQAGISSILN 
KEFSAQAGHI LTFYKEAGQL QAVVKENCPI GFSKTHILPV YIEQGAELSD LLRLGEQAQQ
RRIHIQLAQG HQPAKVVIYK GAGLMGGGKK KKQLTDEQKA IQKEKRARKE EEKAKMCKEE
RSRKREEEVK LKKEKARLRK EERARKRKEE NVRREAEKVA LKNTPLHEAI LEGNATKLYE
LVHSGADIYA KGRYGTTPLQ LAVRKSDVEL ISLLLDQRAD INKDKISKLL YLAIRRSDVE
VVNLLLEYGA DINSREHNGI SPLHVAVDEN RTEVVKLLLE QGVDLNVRNN YQNTPLHWAI
RKGYIGVAKL LVQYGADINA QGEYGASPLH IAVAESQMEL VKLFLEQGAD IYVKGKYNDL
VLHWAAARGN VHITKLLLEH EAYICAKLNW DIFQRSVESV HSLVEREFAI NSKDDSGDTP
LHKAARNGHL EVVEILLDQG ANANATNIKG LTPYQVTKEV SILTVLGALS ITDVNVLDKR
VEKENVSNEL KEAHKGGVDV TSASANSLAK DKYDDTDIIR KNKSLQKAIV RGDVKRVSKL
INIGLDINAK NIDGNTLLYL AAQNSWIEVA KLLIENGAKV NEVSKNGEIP LHSVAEKGQL
ELVDLLAEQK SNFNAKNITG NTPLHLAVIN NHVEVVRLLL QLGAKWNVEN KSGRTPLQFA
IRKGYTAIAD LIISKEKGYM SEEEDTYNEL