Gene Aasi_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1023 
Symbol 
ID6376868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1329239 
End bp1330873 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content34% 
IMG OID642682139 
Producthypothetical protein 
Protein accessionYP_001958100 
Protein GI189502383 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAAAT ATAAAAGGAT ATTACCTGAC AAAAATTTTA TATTAGGTAT GTGTGCAGGG 
ATAGGGCTGT GCTTATATAT AGGCTTATAT TTTTTTCAAA ATTCTTTCAA ACAGCCTGTT
CGTAAATTCG GCGAAGTATT AAACTATATT CAAAAATACC ATATAGATAC CATAGATAAT
GCTAAACTGG TAGAACTTAC AGAGGCAGCG CTCTCAAAGT TAGCTAAGCA GTTAGATCCA
CATACCACTT ATATTGATGC ACAACAGAAT GCAGTAAGTA GAAATCATTT AAAGAGCCAA
ATTGAAGGGA TAGGTATTGA GTTTGTTTTA TTAAAAGATG TAGTGTATGT ATTACATGTT
ATTCCTAAAG GGCCTGCAGA CCAAGCAGGC TTACAAGTAG GGGATAAGGT TGTTAAAATA
GATGGGCATA TTTTAAAAGA AGCAAATTTT AATTCAAATG ATATAGTATT AAAAATGAGA
GGCCCTAAAG GAACTCCAGT AAAAGTCTAT ATATGCCGGA ACAACACAAA AGATCTAATT
GAAATTACCA TCATAAGAGA TCAAATTTCT ATACCGTCCA TCGATGCAGG CTACATGGTA
GATAGCCAAA CAGGCTACAT TAAGTTAAGT CAATTTGCAA GCAAAACTTA CCAAGAATTT
ATAGAAAGGA CAAATCAGCT ATCAGAACAG GGAATGAAGA AGTTACTGCT TGACTTACGA
GATAATTCAG GAGGTTATTT CGAGACAGCC TTAAATATGG CTGAAGAAAT GCTAGAACCA
GGAAAGTTGA TAGTATATAC AAAAGGTAAA TACAAAGGCT TTGATACAAA ATACTATGCA
AATGGGAAAA ATAGGCTTGG TAAGCTACCC ATCATTATTT TGATCAATGA GAATACTGCT
TCTGCTTCAG AACTGTTAGC AGGTGCTTTA CAAGACCATG ATAGAGCACT TATTGTAGGT
AGAAGGTCTT TTGGTAAAGG GCTAGTACAA TGGCCTATTG AGTTTAAAGA TAGCTCTGTA
TTGAGTTTGA CTGTAGAAAG CTACTTTACA CCAAGCGGAA GGTCTGTACA AAAGCCCTAT
GATAAAAGAA TAAACTATGA ATTAGACTTA TATAATAGAT ATAAGCAAGG CGAGTATTTT
CATGCAGATA GCATACAGCT TGACAAAACT ATAGCATATC AAACTTCAGC AGGAAGAACA
GTGTATGGAG GCGGGGGAAT TATGCCTGAT CATTTTATAC CGATAGATAC TACGGCGCAT
AGTGACTATG TTAACGAGCT AGTAGATAAC TACATTATAC AACAGTATGC TATAACATAT
GCACGCTCTA ATAAACAGAA ACTGGAAAAG TTGAGATTAG AGGATTATCT TAAAATTTTT
TGCGTAACTG AAGAAATGGT TGGTCAACTT GTTGAGGAAG CTAAAAAGGC AGCAATCAAG
CAAGTATTTA TAACCGATCC AATAAAAATC TCTATTAAAA ATTTGCTTAA AGCATATATT
GCCAAAACAT TATGGCAATA TCAAGGATTT TATAGTGTAT ACAATAAAAC AGATACAACT
ATTCTAAAAT CCTTACAACT ATTTAACCAA GCAGAAGCAT TACTGCAAGA AGATATAACT
TACATAGCAG GCTAG
 
Protein sequence
MWKYKRILPD KNFILGMCAG IGLCLYIGLY FFQNSFKQPV RKFGEVLNYI QKYHIDTIDN 
AKLVELTEAA LSKLAKQLDP HTTYIDAQQN AVSRNHLKSQ IEGIGIEFVL LKDVVYVLHV
IPKGPADQAG LQVGDKVVKI DGHILKEANF NSNDIVLKMR GPKGTPVKVY ICRNNTKDLI
EITIIRDQIS IPSIDAGYMV DSQTGYIKLS QFASKTYQEF IERTNQLSEQ GMKKLLLDLR
DNSGGYFETA LNMAEEMLEP GKLIVYTKGK YKGFDTKYYA NGKNRLGKLP IIILINENTA
SASELLAGAL QDHDRALIVG RRSFGKGLVQ WPIEFKDSSV LSLTVESYFT PSGRSVQKPY
DKRINYELDL YNRYKQGEYF HADSIQLDKT IAYQTSAGRT VYGGGGIMPD HFIPIDTTAH
SDYVNELVDN YIIQQYAITY ARSNKQKLEK LRLEDYLKIF CVTEEMVGQL VEEAKKAAIK
QVFITDPIKI SIKNLLKAYI AKTLWQYQGF YSVYNKTDTT ILKSLQLFNQ AEALLQEDIT
YIAG