Gene Aasi_1453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1453 
Symbol 
ID6377475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1882971 
End bp1884308 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content40% 
IMG OID642682521 
Producthypothetical protein 
Protein accessionYP_001958470 
Protein GI189502753 
COG category[R] General function prediction only 
COG ID[COG2312] Erythromycin esterase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCTAA TAAAAAACAT ACAGATTATG CAAAAAGATT GGAATCTAAT GGACACTATC 
CAGAGCAATA AGCATAGCCT ACAGGATGAA AAAGATTTAG ATGCTTTAAT CAAGCATATA
TGCGATGCTA AAATAGTGAT GCTCGGCGAG GCAAGTCATG GTACCCATGA GTATTATGCG
GGCAGAGCTT TAATAAGCAG ACGCTTAATT GAAGAAAAAG GATTTAATTT TATTACTGTT
GAAGGAGATT GGCCCCCTTG CTACCAAATC AATAGATTCA TTAAAAATTA TGTAGGTCGC
CAAAAAGATA TCCAGGAAGT ATTGCAGGTA TTTGATCGTT GGCCTACTTG GATGTGGGCT
AATTGGGAAA TTGCTGCATT TAGTTGGTGG TTACATCAAT ATAATTTAAA GCATGTTCCA
CTTAATCGTA TAGGCTTTTA TGGACTAGAT GTCTATAGCC TGTGGGAATC CCTTCAGGCA
ATTGTAAATT ATTTAGAAAA AGAAGATCCT GAAGCAGCTA GTGTGGCTAA ACAAGCTATA
CGTTGTTTCG AACCTTACTG GGAGAAAGAT GATGGCCAGC AATATGCTTG GGCAACGTAC
TTGGTGCCTA AAACTTGTGA AAATCAGGTA ATTGAGTTAC TGCAAACAAT TCAAGCTAAG
ATGCTTCATG ATGATACAGA TATAGAAGCT CGTCTAAGTA CAGAGCAAAA TGCTTGGGTG
GCTGTCAATG CTGAACGTTA TTACCGAGCT ATGATCAAGC CAGGGCCAGA CTCTTGGAAT
ATCCGTGACT ACCATATGAT GGAAACCCTG AACCGTTTAC TACAATTTCA TGGCAAAGAG
GCAAAAGCCA TTGTTTGGGA ACACAATACG CATATTGGAG ATGCAAGATT TACGGATATG
CAAGATGATG GCATGATTAA TATTGGCCAG CTAGCACGGG AACAATATGG AGATCAAGCA
GTGAAATTGG TAGGGCTTGG TAGTTATCAA GGAACTGTAA TGGCTGGCAG GTCTTGGGGA
GCAAAACAGG AAATTATGCA GGTGCCAACA GCAAGAGAGG GGAGTTGGGA AAAGCTCCTT
CATGATATAT CCCCTGATAA CTTCTATTTA TTAATGGATG AGATAAGAGA CAGTTTTGGC
TCTGGCACTG TTTACGACCA TAGGGCTATA GGTGTTGTCT ATCATCCAGA GCGTGAGCAT
TATGGAAATT ATGTTCCTAG TCAGATAGCT AATAGGTATG ATGCTTTTGT ATACTATGAT
GAAACACAAG CGTTGCACCC CATCAAACAT GAAAAGAAGC TTACGAAGAT GCCAGAAACA
TATCCATGGA ACTTTTAG
 
Protein sequence
MLLIKNIQIM QKDWNLMDTI QSNKHSLQDE KDLDALIKHI CDAKIVMLGE ASHGTHEYYA 
GRALISRRLI EEKGFNFITV EGDWPPCYQI NRFIKNYVGR QKDIQEVLQV FDRWPTWMWA
NWEIAAFSWW LHQYNLKHVP LNRIGFYGLD VYSLWESLQA IVNYLEKEDP EAASVAKQAI
RCFEPYWEKD DGQQYAWATY LVPKTCENQV IELLQTIQAK MLHDDTDIEA RLSTEQNAWV
AVNAERYYRA MIKPGPDSWN IRDYHMMETL NRLLQFHGKE AKAIVWEHNT HIGDARFTDM
QDDGMINIGQ LAREQYGDQA VKLVGLGSYQ GTVMAGRSWG AKQEIMQVPT AREGSWEKLL
HDISPDNFYL LMDEIRDSFG SGTVYDHRAI GVVYHPEREH YGNYVPSQIA NRYDAFVYYD
ETQALHPIKH EKKLTKMPET YPWNF