Gene Aasi_1439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1439 
Symbol 
ID6377499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1852982 
End bp1856005 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content36% 
IMG OID642682509 
Producthypothetical protein 
Protein accessionYP_001958458 
Protein GI189502741 
COG category[G] Carbohydrate transport and metabolism
[V] Defense mechanisms 
COG ID[COG1472] Beta-glucosidase-related glycosidases
[COG1680] Beta-lactamase class C and other penicillin binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00938781 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGCT TCGCTATACC CTTTATAACT GCTATATTGT TTTTCTCCCT TACCTTTATT 
CCTACCCAAT CAATTCCATC AGATAAAGCA GGTTGGATAG AATATCAGTT TCAACGGCTC
ACTTTGGAAG AACGTATTGG ACAGCTTTTT ATGGTAGCTG CTTATTCTAA CCAAGGCGAA
AAGCATCATG AATTTATAGA AAACTTAATA CAACGATATA ATATTGGAGG ACTTATTTTT
TTTCAAGGTG ATCCTATCAG CCAAGCAAAA CTTACCAATC AATACCAACT AAAAGCAAAA
ACACCTTTAC TTTTAGCTAT AGATGCAGAA TGGGGGCTTG GTATGCGTCT TACTAACACA
ATAAGCTACC CTAGGCAGAT GACCTTAGGT GCTATACAAG ATCATCAGCT TATTTATGAT
ATGGGTGCTG AAATTGCACG CCAGCTTAAA CTACTAGGTA TACATGTTAA CTTTGCTCCT
GTAATTGATA TCAATAACAA TCCAGACAAC CCAGGTATAG GTAATAGAGC TTTTGGAGAT
GGTAAGGGGA GTGTAATTAG TAAAGGACTA GCCTATATTC AAGGATTACA AGATAACGGT
ATATTAGCAG TAGCTAAACA TTTTCCTGGT ATAGGAGATG CTAGTAAAGA TCCGCATCAT
GAACTGCCTA CTATTCCATA TGATATTACT CGGTTAGAAT CTATAGAGCT TTATCCATTT
AGAAAAGCAA TACGTGCTAA TGTAGGAGGT ATTATGGTTT CTCATATCTA TTTACCAGCT
TATGAAAAAA CACCTAACCG AGCTGCATCT CTCTCATCTC ATATTGTGAC CCAGCTCTTA
AAAAACAAAT TGGGTTTTAA AGGACTTATT TTTACAGATG CGCTTAATAT GAAGGCTGTT
AGTAAGTACT ATCAGCCTGG TGAAGTAGAC TTACTAGCTT TGCAAGCAGG GAATGATATT
TTACTTTTCC CGGAAGATGT GCCTAAAGCT ATTGCACTCA TTAAATCTGC TATTGAGCAA
GGTAAATTAG CTAAGGAAGT AGTAGAAGAA AAGGTTAAGA AAATTCTAGC AGTCAAATAC
CAGATGGACT TACATCAGTG GAAGTCTATA GAAATAGATG GGTTGTACGA GCAACTCAAT
ACACCTCAGG CTCAAGTGCT AAAGCAGAAA CTATTTGAAC AAGCCATTAC ATTAGTAGCC
AACCAAGACG ATTTGATTCC CATTACCAAA TTAAATAAAC ATAAGATTGC TTCACTTTCT
ATTATTAAGC AGCCTATTAC TGCAGAAGCA CAGAAATCAA TCAACCAACA GAATACAGTT
GCTACTAATA AGCCATCTAC TATTTTTGGT CAGTTTTTAT CGCAATATGC TCCTGTTGCT
CACTATACAC TCAATAGGAC ATCACTAGAC GTTAATATGC TACAGCAACT GGCAGATGAG
CTAGAGAATT ATTCGTTGGT TATCGTAGGC TTACACGACT TAGCTGGTAA CAGAGCTAAT
AAATTTGGTT TACAGCCAGA ATTATTAAGC TTTTTAACTA AACTACAGCA CGCCAATACA
AAAGTTTTAA TAGTGGTTTT TGGAAGTGTT TATAGTTTAG AATTATTCCA AAACATGCAA
CATCTTATAG CAGCTTACCA AGATGATCCT ATAGCAGAAC AAGTGGTGCC TCAGATTATT
TTTGGAGCCT TGCCAGCTGT GGGTAATTTG CCTGTTAGTA TACCTAATGC TTGGAAGTCG
GAATGGGGTA TTAGAACAAA AAGTATAAAA AGGCTTGGTT ATGCTTTGCC AGAAGCTGTG
CAAATGGATA GTCGCATATT ACAAGGTATT GATAAAATTG TAGAGGATGC AATTTTAGAA
GAGGTGATGC CAGGCTGCCA AGTGTTGATA GCTAGAAATG GGAAAATAGT TTTTGAGAAA
GCTTATGGTT ATCATACGTA CGCAAAAAAG AATCCCGTTA CTAATACAAC ACTTTATGAT
ATTGCTTCCA TAACCAAGGT AGTAGGGCCT TTGCAGGCAA TCATGTATTT AGTAAGTCAG
AATAAGTTAG ATATAACACA AAAAGTTTCT ACCTATTTAC CAGAACTGTC TGCCACCAAT
AAAAAAAATA TAACTATCAA GTCAATTTTG GCTCACCAAG CTGGATTACT AGATTATGGT
ATAACAAGAA GCATTTTGTT TCAAAAAGAT AGTAAATTAA GCAAGAAACT GTTTAGCAAC
TATCCATCGG CAAGCTATCC CAATAGAATA GGTACGGAGC TATATGCCCC TCATTTATTA
AAAGAGCTTA TGTGGGATCT ATATATCAAC TCTCCAATAA AAGAAAAAGA TAAAACAAAA
AAGCCATCTA AAACACATGG TTACCATTAT AATGACTTAA GCTTCCATAT TATGCATAGG
CTAATAGAGA AATTGTTACA ACAGCCTATG GAGATTTTTC TTGCTAATAA ATTCTATCAA
TCTTTAGGAG CTGCTCTAGT TGGTTATAAC CCATTAGAAA GAATTAGTTT ACAGCAAATA
GCGCCTACAG CAGAGTGTGA CTTTTTTAGA ACTACCCCCA TTCATGGTAT TGTTCATGAC
CCACAAGCTG CAATCTGTGG GGGTGTAGCA GGAAACGCTG GGCTCTTTAG CAATGCTCAT
GATTTGGCTG TTATTCTGCA AATGAATTTG CAAGGTGGCT ATTATGGAGG AAAAAGATAT
TTAAAAAAGA AAGTTATAAA ACAGTTTACT AGCCACGCTT TTAAAAATAA TCGGCGGGGA
CTTGGATGGG ATAAACCAGA ATTGCCTACC AAGCTAGACA GCAAGCATAA GTCTAACACA
TCTTTATATG CTTCTGCTGA TACTTATGGG CATTTGGGCT TTACAGGCAC GGCTGCTTGG
GTAGATCCAA AGTATAATCT TGTATATATT ATACTATCTA ATAGAACCTA TCCTACCCAA
GAAAATAACA AGCTGGCTGA GCAAAATATA CGTATTAAGC TACAAGATAT TGTTTACCAA
GCTTTGCAGA ATATGGAACA ATAA
 
Protein sequence
MKRFAIPFIT AILFFSLTFI PTQSIPSDKA GWIEYQFQRL TLEERIGQLF MVAAYSNQGE 
KHHEFIENLI QRYNIGGLIF FQGDPISQAK LTNQYQLKAK TPLLLAIDAE WGLGMRLTNT
ISYPRQMTLG AIQDHQLIYD MGAEIARQLK LLGIHVNFAP VIDINNNPDN PGIGNRAFGD
GKGSVISKGL AYIQGLQDNG ILAVAKHFPG IGDASKDPHH ELPTIPYDIT RLESIELYPF
RKAIRANVGG IMVSHIYLPA YEKTPNRAAS LSSHIVTQLL KNKLGFKGLI FTDALNMKAV
SKYYQPGEVD LLALQAGNDI LLFPEDVPKA IALIKSAIEQ GKLAKEVVEE KVKKILAVKY
QMDLHQWKSI EIDGLYEQLN TPQAQVLKQK LFEQAITLVA NQDDLIPITK LNKHKIASLS
IIKQPITAEA QKSINQQNTV ATNKPSTIFG QFLSQYAPVA HYTLNRTSLD VNMLQQLADE
LENYSLVIVG LHDLAGNRAN KFGLQPELLS FLTKLQHANT KVLIVVFGSV YSLELFQNMQ
HLIAAYQDDP IAEQVVPQII FGALPAVGNL PVSIPNAWKS EWGIRTKSIK RLGYALPEAV
QMDSRILQGI DKIVEDAILE EVMPGCQVLI ARNGKIVFEK AYGYHTYAKK NPVTNTTLYD
IASITKVVGP LQAIMYLVSQ NKLDITQKVS TYLPELSATN KKNITIKSIL AHQAGLLDYG
ITRSILFQKD SKLSKKLFSN YPSASYPNRI GTELYAPHLL KELMWDLYIN SPIKEKDKTK
KPSKTHGYHY NDLSFHIMHR LIEKLLQQPM EIFLANKFYQ SLGAALVGYN PLERISLQQI
APTAECDFFR TTPIHGIVHD PQAAICGGVA GNAGLFSNAH DLAVILQMNL QGGYYGGKRY
LKKKVIKQFT SHAFKNNRRG LGWDKPELPT KLDSKHKSNT SLYASADTYG HLGFTGTAAW
VDPKYNLVYI ILSNRTYPTQ ENNKLAEQNI RIKLQDIVYQ ALQNMEQ