Gene Aasi_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0034 
Symbol 
ID6376438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp44884 
End bp47175 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content37% 
IMG OID642681232 
ProductSel1 domain-containing protein 
Protein accessionYP_001957218 
Protein GI189501501 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA TTATCACACT CAGTAACCAT AAAGGAAAAG GCGTAAAAAC GACCTTTGCT 
ATAAACCATT CAAAATTAAT ATTAAACAGC CTAATTTTTT GTGCGCTAGG ATTAAGTATA
ATTTCCTGTT CCACGGGGTT AGTAAATACC AACCAGCAAA GTTTTCCCTT AATTATGCAT
GTAAAGCCAG AAGGATTATT AGAGGCAGAA ACGACGATAG AAGCTCATTT CAGTCTAGCC
AACAATGCTG AAATAGCGAA TTTAAAAGAT TGTCAATTAA AAGTTAGTAT CACACCCAAA
GGGAATAGCC ATATCAAGTT GCTATATACT AATGCAGCAG GCAAACAAAA AAGTGTGCCA
AATATTGCTA ACAACCTTAC TTTCTTTACA GATAAAACTG GGGTGGACAT AGAAGATGAG
GTTTTAGTAA TACCTTTTAA GCTTATTCCA GCCGTAGGGG TAGATACAGT GCAAATTACA
TTTGAACTTT TGGATTACGA AGGAAGACGC ATACAAACAT GTCATACAAC CTGGTATAAT
AATCAATCTG CCGGCTTAGG AATGACATCA AGCAAGCAAG TAGATTCCTT AGAAATAGAG
GAAAAAGAAG GCCAAGTATC CATCATAGCA GGAGAACAGG CATCAAGTAC CTTCATAACA
GACCAACAAT TACAGTCAGT TAGTGATTAC CCTACACTTA CAGAAGAAAT AAGCAGCAAA
GAAGAAGTAG TGGAAGGACA ACAGCAAGTA GCTCCTATTA CTATAAATTA TAATTCCACG
CGTATCTTCA AGTTAGCTAA ACTTGCTAAT GCTAATAATA GAGAAGCTCA AGAAACCATT
GTTGGAGGGT ATTTAAGAGT AGGTGTAAGA CCTTATCTAA AAGATTTTAT CAGTCCTTTT
AATTGGGCAG GAATAAAAGA TAAGGCATTA GAAGATGAGC GATATGTTTA TCTTTTGTTA
CGTTTTTTAG GGGAAGAAAA ACAGGGTTGT ATAGATACAG ATATTATTAG TAACATTCGC
AGGCATGCAA AAATGGGGAA TGTCTTGGCC CAAAATAATT TAGGATATAT GTATAGGAAT
GGAGTAGAAT TTCCCCTTGA CTATACAAAA GCAATCAAAT GGTATACCAG GGCAGCCAAA
GCAGGCAATG TACTGGCACA AACTAATTTG GGTTATATGT ATGACAAAGG ATTAGGGGTA
GCACCTAATT CCAAACAAGC GAATAAATGG TACAAGAGAG CAGCTAAACA AGGCTACGCG
GCTGCACAAA CCAATTTAGG ACTCTCGTAT CAGAAAGAAT TAGGAGTAGC TCAAGACTAC
AGGAAAGCTT TCAAGTGGTG TATGAAAGCA GCTGAACAGG CTTATGGAGA TGCACAAGCT
AATTTAGGGA TTATATATCG CGATGGTTTG GGAATTGAGA AAAACTATGA ACAAGCGTTA
ATGTGGTATA CCAGGGCAGC TAGCCTAGAA AATAGAGTTG CACAAGCTCA TTTAGCATGC
ATGCATATGA GAGGTTGTGG AACACCTATA GACCATGATA AAGGCATTTA TTGGCTTATG
AAAAGTGAAA ACCAAAAAAA TATGTTAGCC GCTCGTTCAT ACCTTCCTCA AGATTCTTCC
ACCAATACAG CCTCTCAAGA AGAGCCAAAT GGAATAGGGG AAGAGCTCCT TAAAACATGG
CAAGCAATGA TTATACAAAA AGAGCAAGGC CGTCCCCATA CCCATGCTAT CTTACCACTC
GAAGCTTATC GACAGTTAGA AATAATAATG AATAAATTTA CCCATTGGGC ACATAAAATA
AGTAGCAAAT CTGGCTTAAT GATTAGGTGC GATAATATTA AAGGACAGGA TATAGTGAGT
ACCATTGAAA ATTACCAGAG ATTATCCGGC ACTACGCCCT TTGTAGAGTC ATATATAGTG
CAAGGTAAAA CTTACATTAG TTTTGGCCAG GATAATGTCC AACTATCCAA AGAAATTGTA
AACGAGTTAA TTCAGCGAAT GAACTACAAG CAGGCTAAAG GCATATTAAA ACAGCTTCAA
TTAATTTATA AACAAGCCCA GGCAGAATCA GCCGGTAAAG CAAGTTATCT GGAAGAGCAA
CTAAATTTCT TGCCATTAGT CGAAAAAAGA AATAAACTTT TAAAGAGACT AGAAGAAGCA
AATAAAATAG GAGGGCTGTT TACTTTTAAA CTACAGGATA TAGAAGAAGA GGCTGATCAG
TTTAAGGCTT ATTATAAGCT ACTAATGGAA GAAATTAAAA AAGGAGAAGA TAGACGTAAG
TGGAAATTCT AG
 
Protein sequence
MATIITLSNH KGKGVKTTFA INHSKLILNS LIFCALGLSI ISCSTGLVNT NQQSFPLIMH 
VKPEGLLEAE TTIEAHFSLA NNAEIANLKD CQLKVSITPK GNSHIKLLYT NAAGKQKSVP
NIANNLTFFT DKTGVDIEDE VLVIPFKLIP AVGVDTVQIT FELLDYEGRR IQTCHTTWYN
NQSAGLGMTS SKQVDSLEIE EKEGQVSIIA GEQASSTFIT DQQLQSVSDY PTLTEEISSK
EEVVEGQQQV APITINYNST RIFKLAKLAN ANNREAQETI VGGYLRVGVR PYLKDFISPF
NWAGIKDKAL EDERYVYLLL RFLGEEKQGC IDTDIISNIR RHAKMGNVLA QNNLGYMYRN
GVEFPLDYTK AIKWYTRAAK AGNVLAQTNL GYMYDKGLGV APNSKQANKW YKRAAKQGYA
AAQTNLGLSY QKELGVAQDY RKAFKWCMKA AEQAYGDAQA NLGIIYRDGL GIEKNYEQAL
MWYTRAASLE NRVAQAHLAC MHMRGCGTPI DHDKGIYWLM KSENQKNMLA ARSYLPQDSS
TNTASQEEPN GIGEELLKTW QAMIIQKEQG RPHTHAILPL EAYRQLEIIM NKFTHWAHKI
SSKSGLMIRC DNIKGQDIVS TIENYQRLSG TTPFVESYIV QGKTYISFGQ DNVQLSKEIV
NELIQRMNYK QAKGILKQLQ LIYKQAQAES AGKASYLEEQ LNFLPLVEKR NKLLKRLEEA
NKIGGLFTFK LQDIEEEADQ FKAYYKLLME EIKKGEDRRK WKF