Gene Acid345_4249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4249 
Symbol 
ID4073176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5042322 
End bp5043380 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content55% 
IMG OID637986281 
Productpeptidase M12A, astacin 
Protein accessionYP_593323 
Protein GI94971275 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.150684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.841288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACCC GATCACTCTG CGGCTTGATG TTAGTAGTCG TGGTTGCATC CACGTTGTTT 
GCCCAAGAAA TCGCCGGCCG GCTTCCCACT GTCGAGTCAC TGCCGGACCG CGTGCAGGCA
CGCATTGAGG TCAGAACGAG AGCGGCTGGA AATGCGCACA ACGCCGGTCG CTTTGGCATG
CAGTACTTCA TGTTCATTAC AAAACGTTGG CCGAATGCAG CGAGCACGCC GATTACCGTG
GCGTTCCTTG GCGGCGATCG TCAACTGCGG CAGCGCATCC AAGACACCGT TACGGAATGG
AGTCAGGCTG GCACGCTGAG GTTTGATTTC ATCGATCCGG CGTCCCACAC CTTCCGCGAG
TGGTCGCGCT CTGACACCAG TTTTAAAGCA AACATTCGGG TAGCATTCGA CGGCTCTGAA
GAAGCGGGCT ACTGGTCTAT GATCGGCGTC GACAGCTCCG ACCCAACGAT CATCAAGCCG
GGTGAGGCAT CACTGATGTT GCAAGGATTT ACGACACTGC TGCCGCAGGA CTGGCAGGCG
ACGGTCCGTC ACGAATTTGG CCACGCGCTC GGGCTTCTCC ATGAACACCA AATCCCAGTG
GGCGGGTGCG ATCAAGATTT TCGATGGGAG GACGATACCG GCTACGTGCC CACTCAGGAC
TCCTACGGAC AGTACATTAC CGATGCCCAA GGTCGGCGAC CCGGCATCTA CACCTTACTC
GCAGGCGCAC CGAACTTTTG GCAAAAGGAC AAGGTCAATA GCAACATGAG GCAGCTCGCA
ACTGACTCCC ACAATAAGGA CTTCGGGGCT TTCGATGCAA AGTCAATCAT GAAGTATTAC
TTCGACCCGT CGTTCTTTCG CGACGGAACA GCTGCCCATT GTTACAGCGA CGAAAATTTG
ACGATCTCAG ACGAGGACAA GCAGGGCATA GCGAAATGGT ATCCACCATT CGGCTCTCAG
GAACTGAGCA ATCTACTCAA ATTGCAACAG GACACTATGC GACAACTTGC GCCTGTGCAC
AACATGCAGC AGGTTCAGAC GCTTCAGTCA ATCAAGTAG
 
Protein sequence
MFTRSLCGLM LVVVVASTLF AQEIAGRLPT VESLPDRVQA RIEVRTRAAG NAHNAGRFGM 
QYFMFITKRW PNAASTPITV AFLGGDRQLR QRIQDTVTEW SQAGTLRFDF IDPASHTFRE
WSRSDTSFKA NIRVAFDGSE EAGYWSMIGV DSSDPTIIKP GEASLMLQGF TTLLPQDWQA
TVRHEFGHAL GLLHEHQIPV GGCDQDFRWE DDTGYVPTQD SYGQYITDAQ GRRPGIYTLL
AGAPNFWQKD KVNSNMRQLA TDSHNKDFGA FDAKSIMKYY FDPSFFRDGT AAHCYSDENL
TISDEDKQGI AKWYPPFGSQ ELSNLLKLQQ DTMRQLAPVH NMQQVQTLQS IK