Gene Acid345_2621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2621 
Symbol 
ID4072030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3092186 
End bp3093175 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content61% 
IMG OID637984638 
Productpeptidase U61, LD-carboxypeptidase A 
Protein accessionYP_591696 
Protein GI94969648 
COG category[V] Defense mechanisms 
COG ID[COG1619] Uncharacterized proteins, homologs of microcin C7 resistance protein MccF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.799378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.878166 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTTC GCATGCCGCC ACGCGCCACA ATCCGGCCGC CGGCGCTACT GCCCGGCGAC 
ACCGTGGGCA TCGTGGCGCC CGCCAGCAAT ATTGACCGCC CCGCGTTGCT CGCCGGATGC
GCCGCGCTGG AGCGCATGGG CTACAAGCCG TATTTTCTCG AATCGATTGT CGAACGTGAC
TTGTACTTTG CCGGTTCCAT CGAGCGGCGA GTCGCTGAAC TGGAGCACAT GTTCGCGAAT
CCCGAGGTGC GGGCCATCGT CTGCGCGCGG GGCGGATATG GCGTGAACTA CTTGCTGCCG
AAGCTCCCAA TCGAGAAGCT GCTGGCGAAT CCCAAGCCAT TCGTCGGCTA CAGCGATATC
ACCGTGCTGC TCACGTGGCT CACGGATCAC GGGCTGGTGA CGTTTCACGG GCCGATGGTG
ACGAAGGACT TCGGCCGCGA GTACGGAATT GATCTCGAGA CCTGGATGGC GGTGCTCGGC
AATGCCTCGG ATTATGAGCA TACGTTCTCG ATTGATGAAG TACAGCCACT GGTGAAGGGC
AGTGCGGAAG GTGTGCTTTA CGGCGGATGC CTCTCGCTGC TCGCAGCCTC GATGGGCACG
CCCTACGAAT TCAAAACCGA AGACACGATC CTGTTTCTCG AAGACGTGAA CGAGAAGCCG
TTCCAGATCG ATCGCATGCT GCGGCAGTTG CTGCTGGCAG GCAAGTTCAA GACGGTCCGC
GCCTTCGTCT TCGGCGAAAT GCTCGATTGC CAGCAGCCCC ACGGACAGGA TTACACCTTG
CAGGAAGTCA TTCTGCGCAT TCTCGCACCG CTCGGCGTTC CGGTCGCGTT TGGGCTTTCG
TCCGGACACG TACGTGCCGC GAACCGCGTG TTGCCCTTCG GAGTGTGCGC CAAACTTGAA
GTTGGCGAAC CGGTGCGGCT CAGATGTGAA GCGGCAGTCG CGCAGGGAAC CGGCGCGCCG
CGCATACTCA AGGAACCATC CAAGCAATGA
 
Protein sequence
MAFRMPPRAT IRPPALLPGD TVGIVAPASN IDRPALLAGC AALERMGYKP YFLESIVERD 
LYFAGSIERR VAELEHMFAN PEVRAIVCAR GGYGVNYLLP KLPIEKLLAN PKPFVGYSDI
TVLLTWLTDH GLVTFHGPMV TKDFGREYGI DLETWMAVLG NASDYEHTFS IDEVQPLVKG
SAEGVLYGGC LSLLAASMGT PYEFKTEDTI LFLEDVNEKP FQIDRMLRQL LLAGKFKTVR
AFVFGEMLDC QQPHGQDYTL QEVILRILAP LGVPVAFGLS SGHVRAANRV LPFGVCAKLE
VGEPVRLRCE AAVAQGTGAP RILKEPSKQ