Gene Acid345_2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2140 
Symbol 
ID4072382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2558445 
End bp2559689 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content62% 
IMG OID637984155 
Productcompetence/damage-inducible protein cinA 
Protein accessionYP_591215 
Protein GI94969167 
COG category[R] General function prediction only 
COG ID[COG1058] Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA
[COG1546] Uncharacterized protein (competence- and mitomycin-induced) 
TIGRFAM ID[TIGR00199] competence/damage-inducible protein CinA C-terminal domain
[TIGR00200] competence/damage-inducible protein CinA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGCCG AGATCGTCGC TATTGGGTCT GAGCTCCTAA CCCCCTTTCG TCAGGACACC 
AACTCGCTTT ACCTCACCCA ACGCTTGAAC GAGATGGGGG TCGAGGTCGC GTTCAAAAAC
ATTGTCGGCG ACAGTCGCGC GAACCTGGCG AGCGTGGCAC GTACCGCGAT CGCCCGCTCG
CACATCGTCC TCTTCATGGG CGGCCTTGGC CCAACCGAAG ACGACCTCAC CCGTGAAGCC
GTCGCCGATG CGCTCGGTCT TCGCCTCAAG CGCAATCCCG ATCTCGTTGC CGAACTCTAC
AAGCGCTTCG CATCGCGCCG CGTGACCATG CCCGACAACA ACATGCGCCA GGCCGACGTG
ATTGCCGGAG CCGAAATAAT CCAGAACGAC AACGGTTCGG CGCCGGGACA GTTCATCGAA
GGCGAACAAG ACGGCCAGCC GCGATACATC TTCCTGCTGC CCGGGCCTCC ACACGAACTC
AAGGCGATGT GGAATGAGAA GTGCCACCAT ACGCTGCGCG ACCGTCTACC TCGCGCCTAC
ATCGCCACGC GCGAGCTGCG GATTTCCAGC CTTGGGGAAT CGACCGTTGA CGCCCGCGTC
GCCCCGATTT ACACCAAGTA CAAAAACGTC GACACCACGA TCCTCGCCAA ACCCGGTGAG
GTCAGCTTGC ACCTGAAGAG CCGCGCCGCC ACGATGGAAC AGGCGCAGGC CGCGGTCGAT
CAACTCGCGG CCGAACTCGA AGACGAACTC GATGACGCTG TGTTCTCTAC CAACGGCGAA
TCGCTCGAAC AGATCGTCGG CTACTACCTG CAAATGCGCA GCGGAACGAT CTCCGTTGCC
GAGAGCTGCA CCGGCGGATT GCTCGCGGAA CGGCTGACGA ACGTCAGCGG CAGCTCGCGC
TATTTTATCG GCGGCGTGGT GGTCTATTCC AACCAGATGA AAACCCTGCT CGCCGACGTG
CCCCCGCTGA TGATCGAAGA GCACGGCGCG GTGAGCCGGC AAGTTGCCGT TGCGCTCGCC
GAAAACTTCC GCGAGATCAC CAACTCGACC ATCGGCGTCG GGATCACTGG TATCGCCGGA
CCGACCGGTG GCACCGAAGA CAAGCCAGTT GGCCTCGTGT ACATCGCCGT CGCCGACGAG
CTCGGAACCG ATGTCGTAGA ACGACGTTTC CCCGGCGATC GAGAACGCAT CCGCTGGTGG
TCGAGCCAAG TGGCGCTCGA CATGGTGCGC AAAAAACTGA TCTGA
 
Protein sequence
MIAEIVAIGS ELLTPFRQDT NSLYLTQRLN EMGVEVAFKN IVGDSRANLA SVARTAIARS 
HIVLFMGGLG PTEDDLTREA VADALGLRLK RNPDLVAELY KRFASRRVTM PDNNMRQADV
IAGAEIIQND NGSAPGQFIE GEQDGQPRYI FLLPGPPHEL KAMWNEKCHH TLRDRLPRAY
IATRELRISS LGESTVDARV APIYTKYKNV DTTILAKPGE VSLHLKSRAA TMEQAQAAVD
QLAAELEDEL DDAVFSTNGE SLEQIVGYYL QMRSGTISVA ESCTGGLLAE RLTNVSGSSR
YFIGGVVVYS NQMKTLLADV PPLMIEEHGA VSRQVAVALA ENFREITNST IGVGITGIAG
PTGGTEDKPV GLVYIAVADE LGTDVVERRF PGDRERIRWW SSQVALDMVR KKLI