Gene Aasi_1484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1484 
Symbol 
ID6376500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp142254 
End bp144626 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content34% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003572981 
Protein GI294661106 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCCTAT TACTACTTTT CAGTAATGTT TCTTGTAAAT GTGGCAATTT TAAGCAAGGT 
AAACCAGCTA AACAAGGTAG GCCCATAAAT AAGAAAGGGA GTCTTGCCCT AGAAAAAAAT
AGTCTTAGTA TTAAAGCTTA TCCAGATAAA TTAATAGGAG ATTCTAAAAA GACAAAACTT
GCCATACAAC TAATAGATAT TAGTAAAGAA GTTCAATTAG ATGAAATTAT ACTAAAAAGT
ACACTTATAC ATCAAGATGG AAATGGTAGC CAGCTAAACT ATACTGATGC TGCAGGGAGA
ATACATAAAA TGAGCAACCT AGCAAGTCAA TTAGCAGTAT TTAATAAAGG TACTATATTG
CATGCTAAAG CTCGGTTGTT AGAAGTAGAA GTGGAAATAC TGCCTGGACC AGCTGTTAAA
GAAGTAACAT ACCAATTTGA ATTATTAAAT AATGTAGGCA AGTACATAAA TAGTTGTGAA
GCAAACTGGA AAGAGCAGGA AGCTATAATT CAAGATGTAA TTTATGATCA GACTACCCAA
GAACTGGTAT GTATCTTTAA AAATATAGGT TTAAAAGCAC TAGAAAACAT ACAATTAAAT
TATACTAGTC AAACAGATGA ACTTAAATTA GGCGAAGAAA TTTTAGACAA GGGTACAACA
AAGACAAAGA ATATAGCTGC TCTGCCTATA GGCACTATGG CATTATCCTT AGGTAAATTA
AAATTAGACA CTCAACAACT TGCAAAGATT GAGGTCTCCC TTATATCATC AGAAAATAAG
ATTTTGTTTC AGCCTAAGAC TTATGCTTTC GTTAACCCTG GCATAAAATT AGATTTTAAA
AATTTATATT ATAATGCTTC AGCAAAGAGT ATTGTTTATC AGGTGTCTAA TTTAGGGACG
TTGCCTGGAC ATAAAATTCA GGTTAAGTAT AAGAATATAA GCAGATCTAT AGAGGAGCAA
CAAGTGAACT TAAATGGAGA GAAAGAGCAA ACAGTAGATA TAGAATATTT GGATACAGGT
ACACATACAG CCTTTTATGA GTTACCGATC AATTTCAAAG GGCAAAAACA TGCTGTGTTT
TCATTTAATA TATTATATGC AGGTGTGTCT ATGGTGCATA AAGAGTTGAT ATGTGAAAAT
GAATTGGCTG ATAATGCTAT ATACCAAGCA ATAGAGAAAG AGAATTTTGA TGAGGTATTA
AAGCTTATTG AAAATGCTCC GTTTGATGCG ATCAACTATC AAGATCCTGT TACTAAAGAT
ACTCCTTTAT TATTAGCAGC ACAGTTAGGC CACCAAAGCA TAATAGAAGC ACTACTTAAG
GCAGGAGTAA ACGTAAATAC TCAGAATAAG TACGGAACCA CAGCATTGCT ACGTGCAGCT
ATAGATGGGA AAAAAGATAT TGTAGAGGCA CTAATTAAAA GTGGGGTTGA CTTGGATACT
CAGCATGGAG GTAAGGCATT ACTTCATGCA GTATATAGTG GATACAAAGA TATAGTAAAA
GCTTTGCTTG ATGCAGGAGT AAATGTAAAT ACTCAAGGTG GTGACGGAAG AACAGCATTG
ATGGAAGCAG TATCAAAGTC TTGGAATTTA GAGGGGGAAG AGATAATAGC GCTTTTACTT
AACAATGGGG CTAATATTAA TGTGCAAGAT CAGGAAGGGA ACACAGCTTT GATGTATGCT
ACTTTTGGAA AGGACCAAGC AATAGTAGAG GCATTGCTCA ACAGAGGGGC AAGAATAGAC
CTGAAAAATA AGTTTGGGGA AACAGCGTTA TGGGGAGCAT TTGATAAAGA AAATATATCA
ATGTTAAAGT TATTGTTCAA AAGGTTAGAC AAGCATATTC AAATTAACCA GTTAGGTAAA
TTTATACAAC ATGCACTTCC ATGGGCAATT AGAAGGGGAG CTAAAGAAAT AGTAGAAGAA
TTACTAAATA GAGGTGCTGA GCTTAACAGA TCAGATGAAT GGAACACTAA TCTTATAGAA
GCTATCATAA ACAAACAACC AGAGATTATA AAATTATTGC TTGAAAAAGG TGCTAAAGTG
GATGGCCAGA ATAATAAAGG TGAGACAGCT TTAATGTGCG CAGTTCAGAA AGGAGATACA
GATACAGTCT CTACGTTATT AGAGCAAGGA GCTGATGTTA ACAAAAGAGA TTTGCAAGGC
TTTACTGCAT TAATGTATAT AGTTAAGCGT ATGCATGGAA TTGAAAGCCT AGCGAAGAAA
TTAACGAACG AAGATATATT AGAAAAATTA CTAGAAGCAG ATGCAGATGT TAATATAAGT
AATGGTGATG GAGAAACAGC TTTGGATTTA GCTAATGATA AACCAGAAAT TAAGAATCTA
TTAGAGAGTC ATTTAGCTAG AGCGAAATTT TAA
 
Protein sequence
MSLLLLFSNV SCKCGNFKQG KPAKQGRPIN KKGSLALEKN SLSIKAYPDK LIGDSKKTKL 
AIQLIDISKE VQLDEIILKS TLIHQDGNGS QLNYTDAAGR IHKMSNLASQ LAVFNKGTIL
HAKARLLEVE VEILPGPAVK EVTYQFELLN NVGKYINSCE ANWKEQEAII QDVIYDQTTQ
ELVCIFKNIG LKALENIQLN YTSQTDELKL GEEILDKGTT KTKNIAALPI GTMALSLGKL
KLDTQQLAKI EVSLISSENK ILFQPKTYAF VNPGIKLDFK NLYYNASAKS IVYQVSNLGT
LPGHKIQVKY KNISRSIEEQ QVNLNGEKEQ TVDIEYLDTG THTAFYELPI NFKGQKHAVF
SFNILYAGVS MVHKELICEN ELADNAIYQA IEKENFDEVL KLIENAPFDA INYQDPVTKD
TPLLLAAQLG HQSIIEALLK AGVNVNTQNK YGTTALLRAA IDGKKDIVEA LIKSGVDLDT
QHGGKALLHA VYSGYKDIVK ALLDAGVNVN TQGGDGRTAL MEAVSKSWNL EGEEIIALLL
NNGANINVQD QEGNTALMYA TFGKDQAIVE ALLNRGARID LKNKFGETAL WGAFDKENIS
MLKLLFKRLD KHIQINQLGK FIQHALPWAI RRGAKEIVEE LLNRGAELNR SDEWNTNLIE
AIINKQPEII KLLLEKGAKV DGQNNKGETA LMCAVQKGDT DTVSTLLEQG ADVNKRDLQG
FTALMYIVKR MHGIESLAKK LTNEDILEKL LEADADVNIS NGDGETALDL ANDKPEIKNL
LESHLARAKF