Gene GM21_3733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3733 
Symbol 
ID8139107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4300166 
End bp4301410 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content68% 
IMG OID644871352 
Productcompetence/damage-inducible protein CinA 
Protein accessionYP_003023510 
Protein GI253702321 
COG category[R] General function prediction only 
COG ID[COG1058] Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain
[TIGR00199] competence/damage-inducible protein CinA C-terminal domain
[TIGR00200] competence/damage-inducible protein CinA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones122 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGTGT CGGTTCTCTC CATAGGCGAC GAGCTCCTTT GCGGCGAGGT TGTGGACACC 
AACGCAAGCC ACATCGCCGG CCGGCTCTTT CAGGCGGGGG GGCGGGTGGA GCGGCACCTG
ACCGTCCCCG ACGACGCGGA GGCGATCGTC CGCGCCCTCA CGGAGCTCGG CGCACGCAGC
GAAGCGGTTA TCGTCACCGG GGGCTTGGGC CCCACTCCGG ACGATCTCAC CGCCGAGGCC
GCGGCGCGGG CAGCCGGAAC GGAACTGGAG CTCTCAACGG AAGCGCTGAC CCACCTGGAG
CGTTTCGCGC AAAGGATCAC CGGAGAGCTG CACCCGGCCA ACCGCAGGCA GGCGCTTCTC
CCCAGTGGGT GCAGGCTGAT CCCCAACCCT TTGGGGACCG CCTTGGGCTT CGTGGTCCGC
ATAGGCTGCG CCGACTGCTT CTTCATGCCC GGCGTCCCTT TCGAGATGGA GCGGATGCTG
GAGGAGACGG TGCTCCCGGA GCTGCGGAAC AGGTTTCCGG CCGGCTGGCA GCGGGTGACA
CTGAAGCTCT TCGGCATCGC GGAGGCTGCC ATCGCGGAGC TTTTGGAGGG GGCGATTCCC
GAAGGGTCCC GGGTGCAGCT TGCCTACTGC GTGAAGTTCC CGGAGATCCA CCTGATCCTG
CGGGCCAGCG CCACCGACGC GCCAGCCTTG CAGCAGGCGG CCGGCGAGCT GCGGCGGCGT
CTTGGCGCCT ATCTCTTCGC CGAGGACCGG GAGGAGATGG ACGACCGGCT GGCGCTTTTG
CTGCGGGAAA GCGGCCTCAC CCTGGCGCTC GCCGAATCCT GCACCGGCGG CATGATCGCC
GCCCGCATCA CCGCCGTCGC CGGAAGCTCC GCCTATTTCC TTGAGGGAAA CGTCACCTAC
AGCAACGAGG CGAAGACCAG GATGCTGCAG GTCCCACCCC CCCTGATAGC CGAGCACGGC
GCGGTCAGCG CCGAGGTCGC CCGCGCCATG GCGGTCGGGG CCAGGGAGGC GGCGGGAAGC
GACCTGGCTT TGTCGGTGAC CGGCATCGCC GGCCCGGACG GGGGGACCCT AGAGAAGCCG
GTCGGCACCG TCTACCTGGC CCTTGCCGAC CAGGGCTCTT GCCGGGTCGA GCGCTTCAAC
TTCCAAGGCG ACCGCGACCG CGTCCGTTGC ATCACATGCT TCACCGCGCT CAATTGGCTG
CAAAGCTACC TCCTCACGCG TAAGACGACA CCAGGCCGGG GTTGA
 
Protein sequence
MRVSVLSIGD ELLCGEVVDT NASHIAGRLF QAGGRVERHL TVPDDAEAIV RALTELGARS 
EAVIVTGGLG PTPDDLTAEA AARAAGTELE LSTEALTHLE RFAQRITGEL HPANRRQALL
PSGCRLIPNP LGTALGFVVR IGCADCFFMP GVPFEMERML EETVLPELRN RFPAGWQRVT
LKLFGIAEAA IAELLEGAIP EGSRVQLAYC VKFPEIHLIL RASATDAPAL QQAAGELRRR
LGAYLFAEDR EEMDDRLALL LRESGLTLAL AESCTGGMIA ARITAVAGSS AYFLEGNVTY
SNEAKTRMLQ VPPPLIAEHG AVSAEVARAM AVGAREAAGS DLALSVTGIA GPDGGTLEKP
VGTVYLALAD QGSCRVERFN FQGDRDRVRC ITCFTALNWL QSYLLTRKTT PGRG