Gene Aasi_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1501 
Symbol 
ID6376524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp224938 
End bp228195 
Gene Length3258 bp 
Protein Length1085 aa 
Translation table11 
GC content36% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003572993 
Protein GI294661118 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGGC GGCGTGTAAG AAGTTTACGA GATTTCTCTA TAGGGAATAA AGATTTTACT 
ACAGCCACAG TAACTTCCAC CATTGTAGCT ACCTGGATTG GAGGTGGATT TATGTTTTAT
GGCCTACAGA ATATTTATAA GGATGGCCTA CAATTTGTTA TACCCCTTTT AGGATCAACT
TTATGTTTAC TTTTTACCGG ACAAGTCCTC GCTATTAGAA TGGGAGAATT CTTAAATAAC
TTGTCAGTAG CTGAAGCCAT GGGGGATCTC TATGGGCCCA TAGTACGAGT TATTACTGCT
ATTAGTGGGA TATTAAAGTC CATAGGTTCA ATAGCTATTC AATTTCAGGT GATTGCTAAA
ATGTTGAATC TTCTTTTTGG CATTGAAGAG ACTTGGGCTA TTATTGCAGC TGCGTCCATT
GTTATTCTTT ATTCAGCTTT TGGCGGTATC CGTTCAGTTA TTATTACGGA CTTATTTCAG
TTTATTGCCT TTGTTATTTT CATTCCTATA TTAGGCCTGA TAGTTTGGAA TCATATTAAA
AATCCTGGTC AAGTTCTTTC TACTCTAACT ACCAACCCTA TTTTTAGTTT AGAACAAACA
ATAGGATGGA ACCCCAAATT TATAAGTTCA ATAGGGCTAA TGCTTTATTT TTTAATTCCT
GGCATGGCCC CTGCTATCTT TCAGCGGGTA ACAATTGCTA GAGATCTTGA ACAAGTAAAA
AGATCTTTTA CCTACGCAGC TGGAAGCACT ATCCTAGTAA ATATTGCAGT AGCTTGGGTT
GCTATTTTAC TTTTATCAGA TAGCCCTAAT CTTGAACCGA GTAAACTAGT TAATTATATT
ATTACAACGT ATGCTTATCC AGGATTAAAA GGGCTTATTG CTGTTGGTAT TACAGCCATG
GCTATGTCTA CAGCAGATTC CTATTTAAAT TCCTCTGCAG TATTAGCCGT TAATGATATT
ATTAAGCTAT TTAAACCTTC TTGGAAAGAA TCTATTATTG TTATTAGATC TTTCTCATTG
ATTTTAGGAG TTTTCGGATT ATTACTAGCC CTCCATGCCA AAGATCTGCT TCAGCTGCTG
CTCCTATCTG GCAGCTTTTA TATGCCTATT GTTACTGTAC CACTTCTACT AGCTATTTTT
GGTTTTAGAA GTACTACTAG AGCAGTCCTA ATAGGAATGG CAGCAGGCTT TATAACAGTA
GTAGGCTGGA AAAAGTTTTT TGGATATACA GGTATGGATA GCCTTATTCC TGGTATAATA
GCTAACTTAG TTGTCTATAT AAGTAGCCAT TATATTTTAA GAGAACAAGG AGGATGGGTA
GGGATTAAGG AAAAAGGTCC GCTTTTAGCA GCCAGACAAA GCCGTAGAGA AGCATGGAAC
AAGTTACTAT ATACCATTAA GCATCCCCAT ATTTATGCTT ACTTACAAAA AAACCTCCCG
GCTTATGAAG TTGTTTATTC GCTCTTTGCC GTTTATGTGA TAGGTGCTAC CTATGCTTCT
TTTTATACCA TATCAGAAAC AGTAGTTTCT TCTTATCAAG GGCTTTATAA CTTTGCAGCG
CATTCTGTTT TAATTGCAAC AGCTGGGTTC TTAACATACC CTGCTTGGCC TCCCACTTTT
AAATCTAAAA AGTTTATCAC CTTTGCTTGG CCTCTTGGGA TATTCTATAT TTTATTTGTT
GTGGGCACTA TTTTAGTGCT GATGAGCGGT TTCCACGAAG TACAAGTAAT GATCTTTATG
CTTAACTTAA TCATGGCTGC TTTCCTACTT TCTTGGCCTT TAATGCTCTT CCTTGCTACA
TACGGTATCC TAATAGGGTG TTTAATTGTA TATATGTACT GTGGCAATAT ACATTGTAGT
GGAACAGATG GCACAGCTGA ATTCAAAGTT ATTTATAGCA TTCTTTTATT AAGTAGTTTT
CTAATTACAA TATTTAGGTT TAAGCAAGAA AAAAAGTCCT TGCAAAACAA GAATATTTAC
CTAGAAGGTT TATATGAAGA AAAAAACAAT GAGCTAGCAC AAATTTTAGC TTATGGAGGT
GAGGTTTTAA AGGAACTCAA TGCTGACGAG AAAGCATTAA CGGCGGCTTA TATAGAGCAG
ATTATCTACC GTATGACGGA TTACATCCGA TTGGAAGTAG CTCAGATAAA ATTAGACCAG
CTTTTATTAG AGGTAAAAGA AACCATTAAA CTCATGAACT TATTATCTCT TCCCCAGCTG
ATAACGAGTG TAGATACTCG CCAAGAAATT ATTGATGCAG ATAGAGTAAA ATTAAAACAA
CTGCTGGTAA ATGGTATCCT ACATGTACAC CAACATAATG CAAACAACCA GCCTATTCAT
GTAGTAGTAG AAGACGTTAA GCTAGGCTAT AAGATAGATT ATATTAAAGA TTACACCAGG
CAATTAGCAG CCTTAAGATT TACCATTACC ATAGAAAAAG ACACTGCTAA CAAAAAAGAT
ATTTACTTAC TTGAGCAGCT GCCTTTGATG AGTCAACATA CAAAAAAAGG TAAACTAATA
GAAAATGCTC GTATTATTCA TGCTCATTAT GGATACGCAG AATTGGATAA CGAGCAAACC
CAAGTATATG TACTTCCAGC CAACGTAAGA GAAGTAAGAG GTAAAGTGAT GGAGTTATTA
AGGGAACCTG TAGCAGTCGA TGAAGCGGAA ATGAAACATC CGCTCGCTAT AAAACTAGAA
AATGAGCTAT TAGATAAGAT CAAGGCTATC AAAATAGATA CAAAAACTAT AGCTAAAGCA
TTAAAAACTA TTAAAAGATA CCACGCCGGT GTTAAGCGTA AATCAGGCGA GCCTTTTTTT
ACCCATCCCA TAGCTGTCGC TCTAATCTTA TTAGAATATT GTAAGGATCA AGATGCAGTG
GTAGCAGCGC TACTTCATGA CACAGTTGAA GATACAAGTC TTTCGCTGGT GCAGATTGAA
GCTATATTTG GAGAACAGGT AGCATTTATA GTAAAAAAAG TAACTAACCT AGAAGATAAT
TTACGCAGGA TAAGCTTAGC AGATCATGAG AATGTTTATA GACTCATGGA GTATGAAGAT
GAACGTGCTG CCTACGTAAA ACTAGCAGAT AGGCTACATA ACATGCGCAC CATCAGTGGC
CACTCTTCCC TGGCTAAGCA GAAGCATATA GCAAATGAAA CATTAAATTT CTTTGTAGGA
CTTGCTGAAA AGTTAGGTTT GGAACCTATT GCAAGTGAGC TTAAAAAACT TAGCTTAGAA
GTCTTGGCTA AGAGATAA
 
Protein sequence
MAGRRVRSLR DFSIGNKDFT TATVTSTIVA TWIGGGFMFY GLQNIYKDGL QFVIPLLGST 
LCLLFTGQVL AIRMGEFLNN LSVAEAMGDL YGPIVRVITA ISGILKSIGS IAIQFQVIAK
MLNLLFGIEE TWAIIAAASI VILYSAFGGI RSVIITDLFQ FIAFVIFIPI LGLIVWNHIK
NPGQVLSTLT TNPIFSLEQT IGWNPKFISS IGLMLYFLIP GMAPAIFQRV TIARDLEQVK
RSFTYAAGST ILVNIAVAWV AILLLSDSPN LEPSKLVNYI ITTYAYPGLK GLIAVGITAM
AMSTADSYLN SSAVLAVNDI IKLFKPSWKE SIIVIRSFSL ILGVFGLLLA LHAKDLLQLL
LLSGSFYMPI VTVPLLLAIF GFRSTTRAVL IGMAAGFITV VGWKKFFGYT GMDSLIPGII
ANLVVYISSH YILREQGGWV GIKEKGPLLA ARQSRREAWN KLLYTIKHPH IYAYLQKNLP
AYEVVYSLFA VYVIGATYAS FYTISETVVS SYQGLYNFAA HSVLIATAGF LTYPAWPPTF
KSKKFITFAW PLGIFYILFV VGTILVLMSG FHEVQVMIFM LNLIMAAFLL SWPLMLFLAT
YGILIGCLIV YMYCGNIHCS GTDGTAEFKV IYSILLLSSF LITIFRFKQE KKSLQNKNIY
LEGLYEEKNN ELAQILAYGG EVLKELNADE KALTAAYIEQ IIYRMTDYIR LEVAQIKLDQ
LLLEVKETIK LMNLLSLPQL ITSVDTRQEI IDADRVKLKQ LLVNGILHVH QHNANNQPIH
VVVEDVKLGY KIDYIKDYTR QLAALRFTIT IEKDTANKKD IYLLEQLPLM SQHTKKGKLI
ENARIIHAHY GYAELDNEQT QVYVLPANVR EVRGKVMELL REPVAVDEAE MKHPLAIKLE
NELLDKIKAI KIDTKTIAKA LKTIKRYHAG VKRKSGEPFF THPIAVALIL LEYCKDQDAV
VAALLHDTVE DTSLSLVQIE AIFGEQVAFI VKKVTNLEDN LRRISLADHE NVYRLMEYED
ERAAYVKLAD RLHNMRTISG HSSLAKQKHI ANETLNFFVG LAEKLGLEPI ASELKKLSLE
VLAKR