Gene Aasi_1477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1477 
Symbol 
ID8999544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp101889 
End bp103733 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content39% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003572975 
Protein GI294661100 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTACTT GCTTATTCTG GTGTAGGTTA AATGCATACT TATTCCTTTT AGTATATAAC 
AAGGATTACA TTTTAACCTT TTCTACTCTC TTTATCAACC TCTTTCGCAG TAGGTCTATT
GAGCAACAAG AAGGCAATAC TTTGCCGACT ATTATGCCGG AATTATGGCA GTACATTTTC
TCTTACCTAG ACTTCGAGGG AGTCCTTGCA GCCAGATCAG TTAGTTCTGA TTGGAATAAG
CTCATCACAG GCTTTAGACA AGCAGGTATA GTAGGCGTTG AGAATAAGCC TTGCCATATT
ATTGATACAA GTGGTTGGGT AAAAGTCAAG GAAGTAGATT TTAAATATAG CAACCTAAAA
AAGGATTTAA CTCCTGCAAC CATCCCTAGT TTTATTTTTT ATCACTTAAT GGGACATGTA
CGTAATTTAG CTCAAAGCTG GTGGCCTTAC TTACAAGGAA ATAATATACA TACGCTTGGT
TTGAGCTGGA ATCATCTAGG TGACCAAGGA GCCATAGTAT TAGCCCAGCA TTTACAAGGT
ACAAACGTGC ACACGGTTGA TATAAGCGAT AATCAAATAG GTGATCAAGG ATTAGAAGGA
TTTGCTAAGC ACCTGCAGGA AACAAACGTT CATATGGTTG TTTTAAGCAG GAGTGAAATA
GGTGATCAAG GAGCAGAAGG ATTCGCGAAG CACTTACAGG GGACGCGCGT GCATGTGGTT
GATTTAAGCA TGAATAAAAT GGGTGATTCA GGCGCAGAGG CATTGGCTAA GCACCTGCAG
GGAACACAAG TGCATACCCT TGATTTAAGT ATGAATAAAA TAGGCGATCA AGGCGCAGAA
GCATTGGCTA AGCATTTACA AGGAACTAAC TTGCAGGAGA TAGATTTGAG ATTTAATCAA
ATAAGTGATA GGGGAGCCAT AGCATTAGCC CAGCATTTAC AAAAAACTAA CATACATACC
CTTAATTTAA AATCTAATAA AATAGGAGCA CAAGGTGCCA TAGCATTAGT TCAGCATTTA
AAAGATGCTA GAATTCATAC GCTTGATTTG TGTGAAAACA AAATAAAGGA TGCAGGAGTT
ATAGCATTAG CTCAGTATAT AAAAGGAACC AGCGTACATA CAATTAACTT AAGCAAGAAC
AAAATAAGTG TATTAGGGGC TATGGAGTTA GCCAAGCATT TACAAGAAAC CAATGTGCAT
ACCCTTAATT TAGGCTATAA CCTGATAAGA GCTGGTCAGC TAGTTAAGCA TTTACAAGGA
ACGCGCGTGC ATACGCTTAA TTTAAATGGA AACAATATAG ATGGCCAAGG AGTCATAGCA
TTAACTCAAT ATTTAGAAGG TACTAACGTG CAGGAGCTTT GTTTAGGCGA CAATCAAATG
GGCGCAGAAG GCGCCATAAG ATTATGCAAG TATTTACAAA GTACTAATCT AGAGAAACTT
GGGCTAGGTA GTAATGAGAT AGGCAATGTA GGAGCCATTG AGTTAGCCAA GCTTTTACAA
GATACTAATG TGCATACGCT TGATTTAAGT ATGAATGAAA TAGACGATGT AGGAGCAATC
GGAATAGTCC AACATTTACA AGGAACGAAG GTGAAGGAGC TTCGTTTAAG CTTTAATGAA
ATAGGTACAG AAGGAGCGAT CGGGATAGCC CAACATTTAC AAGGAACTAA AGTACAGGAG
CTTGATTTAG GTCACAATCG TATAGATGAA GAAGGAGCTA TGCAATTAGG CAAGTATTTA
CAAGGAACAA CTGTTCATAC GCTTGATTTA AGGTCGAACC AAATAGAAGA TGGTACGCAG
CAATTATTGC AGCAACAATA TCCTCATATT AAGTGGTGGT TTTGA
 
Protein sequence
MGTCLFWCRL NAYLFLLVYN KDYILTFSTL FINLFRSRSI EQQEGNTLPT IMPELWQYIF 
SYLDFEGVLA ARSVSSDWNK LITGFRQAGI VGVENKPCHI IDTSGWVKVK EVDFKYSNLK
KDLTPATIPS FIFYHLMGHV RNLAQSWWPY LQGNNIHTLG LSWNHLGDQG AIVLAQHLQG
TNVHTVDISD NQIGDQGLEG FAKHLQETNV HMVVLSRSEI GDQGAEGFAK HLQGTRVHVV
DLSMNKMGDS GAEALAKHLQ GTQVHTLDLS MNKIGDQGAE ALAKHLQGTN LQEIDLRFNQ
ISDRGAIALA QHLQKTNIHT LNLKSNKIGA QGAIALVQHL KDARIHTLDL CENKIKDAGV
IALAQYIKGT SVHTINLSKN KISVLGAMEL AKHLQETNVH TLNLGYNLIR AGQLVKHLQG
TRVHTLNLNG NNIDGQGVIA LTQYLEGTNV QELCLGDNQM GAEGAIRLCK YLQSTNLEKL
GLGSNEIGNV GAIELAKLLQ DTNVHTLDLS MNEIDDVGAI GIVQHLQGTK VKELRLSFNE
IGTEGAIGIA QHLQGTKVQE LDLGHNRIDE EGAMQLGKYL QGTTVHTLDL RSNQIEDGTQ
QLLQQQYPHI KWWF