Gene Aasi_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0044 
Symbol 
ID6376717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp60944 
End bp62335 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content35% 
IMG OID642681241 
Producthypothetical protein 
Protein accessionYP_001957227 
Protein GI189501510 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATA CTTACTCCTT ACATATAATA GTACGTATTT TAGTGCTATG CTTGTTTTTA 
CAAAACTGCT CAGGCTTTTC CAATGCACCA TTAAATAGCG AAAAAGAGTT TAACATACAG
GATTTACTAG ATCAAGAATT TACAGCAGAT GGGGGGCATT TAGTTTCTTT TTACGAGGGG
CAAGAAGAAA TTAAGGCAAC TGTACAAGTA AACCCCCTTG ATGAAAAAGA TAAAATTTAT
AATGAAGTAA ATGTAGTTGT AGAAAAAGGG GTAGAGCTAG CGAGCTTAGC AAAGCTAGAT
AGAAAAACAC AGCAAAAGCG TATACAAATT CAGTTCTCTA AAGAGCAAAA AGGCAAACCT
CAAAGTGTCG TAATACATAA ACCTTGGTTG ATGGGTGGGA TGAAGGAGGT TATTATATTC
TGTGGAAACC CAGGAGTTGG AAAAAGCTCT TTATGTAACT CTATTTTTCA AAGTTCAAAG
CCAATATTTA ACTCCGGGGT ATCTATTTTA ACAGGAATGA CAACCAATAA ACAGCAATAT
CTGCATGAAG GAAAGCTATA TGTCGACACA CCAGGTCTAG CAGATCCGGA AACTCGTACG
AAAGCTGGCA AAGCAATAAC AGAGGCATTA AAACATAATG GCAATTACAA AATAGTCTTT
GTTATAACTT TAGAGGGTGG AAGGCTAAGC CCTGAAGATG TGGCTACCAT TGAAACAGTC
TGCGAAGCAA TTAAGGTTCC TTTTGAATAT GGTTTAATTT TCAACAAGGT TACTCCAGGA
ATTAGGAAAA AAATAATAGG TATAGGAGTA GAATCATACG TAAAGAAGTA TAGTATAAGC
TTAGATAATA ATATTAATAA CCTCACTGAA GAGTTCATCT TGAATCTTAT ACAACTTGGC
TTATCAGAGG GTTACTTTAA AGCATTTACT AAACAACCAT CATCGGCAAC TATGCTTATG
AGGGAAAGTC ACATGGAAGA CGAGGAAGGT GAGTATTTTA GTGCTAATAG TCCAAATATG
AAGAATTTAT TAAATTTCCT TGGCAAGCTA AAGGCTACTG AAATACATGA ATCCAATATT
ATTCCACTAG ATACTACGGA TTATAAGAAA AAGATCGAAG AACAAGAAGC AAAAAATAAG
AAGCTAGAAG AAGAGCTTAA CAAAGTTAAA GAAGAAAATA GGAGACAAAT AAGAGATCTG
GATGCACAAA TTAATAAATT AAACGAAGAA TTAGCTAAAA AAGGGGAAGG CTTTTGGAGT
AAAGTTGGAA ATTTCTTAGG TGACGTTGGA ATTGCTATTG GAAGTGCTAT TGTGGGTGGT
ATTGTAAAAA GTATTTTTCA TAGACCAGGC CCTACATGTA ATCCGGATCC TGGGGGGCAC
ACAGAATTGT AA
 
Protein sequence
MKHTYSLHII VRILVLCLFL QNCSGFSNAP LNSEKEFNIQ DLLDQEFTAD GGHLVSFYEG 
QEEIKATVQV NPLDEKDKIY NEVNVVVEKG VELASLAKLD RKTQQKRIQI QFSKEQKGKP
QSVVIHKPWL MGGMKEVIIF CGNPGVGKSS LCNSIFQSSK PIFNSGVSIL TGMTTNKQQY
LHEGKLYVDT PGLADPETRT KAGKAITEAL KHNGNYKIVF VITLEGGRLS PEDVATIETV
CEAIKVPFEY GLIFNKVTPG IRKKIIGIGV ESYVKKYSIS LDNNINNLTE EFILNLIQLG
LSEGYFKAFT KQPSSATMLM RESHMEDEEG EYFSANSPNM KNLLNFLGKL KATEIHESNI
IPLDTTDYKK KIEEQEAKNK KLEEELNKVK EENRRQIRDL DAQINKLNEE LAKKGEGFWS
KVGNFLGDVG IAIGSAIVGG IVKSIFHRPG PTCNPDPGGH TEL