Gene GSU1657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1657 
Symbol 
ID2687094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1817400 
End bp1819715 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content59% 
IMG OID637126338 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionNP_952708 
Protein GI39996757 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCCCTC CCTGGCTTGC CTGGCTTTGC CTTGCGGGTT CAACGGCGCT CATTTTCTTC 
AGGAGCGCAG TCCCTCTTAT AGTTGCTGTC GGGCTCACCT TTTTCGTCTG GGGAGCCTCG
TCCCTTGCCC CTTTTCTCTA TCCGGATCTG TCGTCTACCC ATGTGGCCAG ATACACCGGC
TACGGACCGG TAATCATCGA AGGTATTGTC GATTCACGGC CCGACGTCCG CCCCAGTGAA
GGGAAGTTTT GCCTGGCCGC CGAATCTGTC ATAGCCGCCG GTTCGACACT GCCCGCAAGC
GGACGGCTCA TAGTCTATGT GAAGGCAGGA ATGGTACCGT TTCTCACTGG TGATCGGGTG
CGTTTTCGGG CCAAGGTTGC AGAGCCCTCC AATTATGGAA TTCCGGGCGA ATATGACTAT
CGGCGCTCCC TGGCGTATCG GGACATCCAC GCCACGGCGT TTGTAAGGCA TGCCGACGAT
ATCCTGTTGA TGCGACAAGC AGTGGCCTTT CCGGTGCAAC GTTTTTTCGA TCGATCGGCA
GCCAAGCTCG GCGATGTCAT CGGTCGGTAT TCTCCCAACG AAGGAGGCGT TCTCCGGGCG
CTCCTGCTGG GGGAGCGGGG ATTCGTCTCT AAAAAGCTGG AAGAGGCCTA CGCACGTGCC
GGTGTAAATC ATATCCTCTC AATCTCCGGT TTCCATGTGG GAATTATTGC TCTGGTCATC
TACCGGGCTC TGCTTCTACT GCTCAGTCGC TTCGAGCGCC TTGCCCTGCA ATTCAACCTG
CGGCGGACAG CACTGCTGGC GACACTTCCC CCTGTTACAT TCTACCTGTT TCTTTCCGGT
GCCGCCCCGG CCACGGTCCG TTCCGTGCTG ATGATAGCCG TAATCACCCT CGCGCTCTGG
CTGGAGCGTG AAACAGACCC GATCAACGTA TTGACCATGG CGGCCCTCGC AATGCTCGCT
GCCAATCCCC CGTCGCTGTT CGATATTTCC TTCCAGTTGT CGTTTCTCGC CCTTTGGGGA
CTTGTGGTCC TGACGCCGGT TTTCACTCAT CCGTTGTGCA CTCTGGATAA TGGAGTGGTC
AAGAACGTCA TCCTGCTGCT GGCCGCTTCG ACTGCAGCGA CCCTCGTCAC GTTTCTTCCC
GTGGGGCATG CCTTTCATCG GGCAACTGTG GCAGGGATTA TCAGCAACGT TTTCATCGTA
CCGTTGATGG GGTACGGAGC GGTGGTGGCC GGATTCGCCG CGTTGCCGCT CATTGCGGTT
GCCCCGGTTG CGGCCGGGCC GCTCATTACT ATTGCTTCCT GGCTCGTTGC ATTGTCCAAC
CGGATTATCG AGTGGCTGGC ACGGATTCCC CCGGTCCCCC TTCACAGCGT CACCAGGAGT
GATCTGCTCG TGTCGTTTCT GGTGCTGCTG GGATTAACTT TGCTGCCCGG TCGCCGTGCG
CGGATCCTGG TCGCAGGAAT TGGCGCCTTA GCACTTTCCG CTCTTCACTT GCCTGCCATG
GCGGGAGGCG ATGCCAAGCT CGTAATCACC TTTCTGAGTG TAGGGCAGGG GGAGTCAACG
CTCGTTTCTC TTCCCGACGG CAAGATCATG CTGGTGGACG GGGGAGGAGC CGTTCATGGA
GGCGGAACCG ATGTGGGGGA ACGACTTCTT GTGCCTGCTC TCTGGAGTAT GGGGGTGGAA
TCTATCGATT ATCTTGTATT GACCCACCCT CATCCCGACC ACCTGGAAGG CCTTCTCTAT
CTGGCAACGG CTATGCCCGT TGGTGAGTTC TGGGAAACGG GGCAGTCCAC GGAAAACACG
TCACTGGCAG AGTTGAGGGC CCGATTGGTC GCGCAGGGAG TGCCCATCAA GCACCTGTCC
GCCGCAACAG CGCCGTTCAC TGTCGGCGGT GCTCGGGTGG AACCGCTCTG GCCATGCGAC
AAGAGCCCGG GGAAGGGGGC TGATAATGAT GATTCACTGG TTTTCAGACT TGCCCTTGGC
GGCACGTCCA TCCTGTTCAC CGGCGATATC GGTGCTCCGG CCGAGGATGT CCTTGCGAAC
GATCCGGCGC GTCTGCGTTG CACGGTGCTG AAGGTGCCGC ACCATGGCAG CCGCTACTCA
TCATCCCCTT GCTTTCTCGA CGCAGCATCG CCTCAAGTGG CCCTGATTTC CGCCGGGCGC
AGAAACAACT TCGGGCTTCC CGCACCTGAA ACGCTCACCC GCCTGGTAGC CCGCGGGATA
GATGTTTATC GTACTGACCG GGATGGTACG GTGCAGGTTA CCTTCACCGG GGAGACGTGG
GCGGTGGGTA CCTTCGCGAA AGGGCATTTT CGTTGA
 
Protein sequence
MFPPWLAWLC LAGSTALIFF RSAVPLIVAV GLTFFVWGAS SLAPFLYPDL SSTHVARYTG 
YGPVIIEGIV DSRPDVRPSE GKFCLAAESV IAAGSTLPAS GRLIVYVKAG MVPFLTGDRV
RFRAKVAEPS NYGIPGEYDY RRSLAYRDIH ATAFVRHADD ILLMRQAVAF PVQRFFDRSA
AKLGDVIGRY SPNEGGVLRA LLLGERGFVS KKLEEAYARA GVNHILSISG FHVGIIALVI
YRALLLLLSR FERLALQFNL RRTALLATLP PVTFYLFLSG AAPATVRSVL MIAVITLALW
LERETDPINV LTMAALAMLA ANPPSLFDIS FQLSFLALWG LVVLTPVFTH PLCTLDNGVV
KNVILLLAAS TAATLVTFLP VGHAFHRATV AGIISNVFIV PLMGYGAVVA GFAALPLIAV
APVAAGPLIT IASWLVALSN RIIEWLARIP PVPLHSVTRS DLLVSFLVLL GLTLLPGRRA
RILVAGIGAL ALSALHLPAM AGGDAKLVIT FLSVGQGEST LVSLPDGKIM LVDGGGAVHG
GGTDVGERLL VPALWSMGVE SIDYLVLTHP HPDHLEGLLY LATAMPVGEF WETGQSTENT
SLAELRARLV AQGVPIKHLS AATAPFTVGG ARVEPLWPCD KSPGKGADND DSLVFRLALG
GTSILFTGDI GAPAEDVLAN DPARLRCTVL KVPHHGSRYS SSPCFLDAAS PQVALISAGR
RNNFGLPAPE TLTRLVARGI DVYRTDRDGT VQVTFTGETW AVGTFAKGHF R