Gene GSU2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2118 
Symbol 
ID2687762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2335994 
End bp2337334 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content57% 
IMG OID637126809 
Productintegrative genetic element Gsu21, integrase 
Protein accessionNP_953167 
Protein GI39997216 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAG TGGGTGAAAA GATCAAGCGT CGGACGTTCT GCTTCGCCAA GACCTTCTAC 
TGGCTGGACG AGAAGACGAA GGCCCACTTC ACGGCGCTGG AGGGAAGCGA CTACAAGCCG
GAACCGGAGC ACGTCTACTT CGGTGAGTAT TGCGAACAGT GGATGGAGCG GAAGATTCCG
ACCTTTTCGT CGGTGACGAA GCAAAGGGAT TACCGGGAAG CGCTCACCTC CCGCATCCTG
CCCTACTTCG GGGAGATGAC CTTCTCCCAG GTTACCGCCA CGGCGGTAGA GACGTTCATT
GACAATCTAA AGAGAGTAAA CCGTGCCAAA AATCCCAAGA AGACCAAGGG GGCAAAGCCC
CTGTCGGTGA AACGGGTCAA AAACATCATC GGTCCAATGT CAAAGGTCTG GGAATCGTCC
TGCAACGACT ACAACTGGAA TCTCCGCGAT CCGTTTTCCG CAGTAACCCA GAAGTACACG
GAGTTGACTG ACAGGGCGCT TCAGGAAAAA GAGCGGCAGG CCGCTCTGAG GAGTGATGAG
GAGGAAGATG TCTCGACGAG GGAGATCTTC CTGCTTGAAG AGTGGCAGAT ACTCTGTTCC
TACATCGATC CCCACTATTA CCCCGTGCTG GAACTGCTGA TGCTGGGGAT GATCGGCTCG
GAGTTGGAGG CACTGCAAAA GCGGCACATA AAGGGTGGCG TGCTGACAGT CCGCTGTGCG
GTAGCGAGGG ACCGGAAGGG GATGCGGCAC CTGAAGTTCA AGCCGAAGAA CTGGTATCGC
AAGCGGGACG TCCCCCTGAC CGGCAGAGTA CAAAGCCTTC TGGAACAGGC GATGGCTACG
GCGACGAGGG ACGGGGTTGT TACCTTCGCC AACGACATCG CCATCCCGGC CAACCAGTTC
GTCCTCACCA TGAAGGACGG CAGCCCCTTC AACTACAACT CATTCCGCAA GACGGTGTGG
AACAAGGCCT TGAAGGCGGC AGGCATGGAG CCTCGGGTTC CTTATGCGGC CCGGCACACT
CTGGTGCAGT GGTCGCTTCT GATCGGAATG ACCAAGACCC GGCTCGTGGA CCTGATGGGT
CATTCGACCA AGAAGATGAT CGACGAGGTG TACGGGAGCT ATCGGCAGGG ACTGGTGGAG
GAGAGGGAGC GGATTCTGGA TTACCTGGGG GAAGACTTCC TCGCCCTGGA AGAGATGAAG
CTTGCGTTCC CCGAGCGCTA CCGGCGGCGG ATGGCAACGA CGGAGCCGGC CCATGAAACG
GCGAAAGCCC CGGGCCTTCC CGCCACTTTT GGTCAAAGTT TTGGTCAAAG CCAGGGGCTC
TATCCGGATA ACTACCCGTA A
 
Protein sequence
MEKVGEKIKR RTFCFAKTFY WLDEKTKAHF TALEGSDYKP EPEHVYFGEY CEQWMERKIP 
TFSSVTKQRD YREALTSRIL PYFGEMTFSQ VTATAVETFI DNLKRVNRAK NPKKTKGAKP
LSVKRVKNII GPMSKVWESS CNDYNWNLRD PFSAVTQKYT ELTDRALQEK ERQAALRSDE
EEDVSTREIF LLEEWQILCS YIDPHYYPVL ELLMLGMIGS ELEALQKRHI KGGVLTVRCA
VARDRKGMRH LKFKPKNWYR KRDVPLTGRV QSLLEQAMAT ATRDGVVTFA NDIAIPANQF
VLTMKDGSPF NYNSFRKTVW NKALKAAGME PRVPYAARHT LVQWSLLIGM TKTRLVDLMG
HSTKKMIDEV YGSYRQGLVE ERERILDYLG EDFLALEEMK LAFPERYRRR MATTEPAHET
AKAPGLPATF GQSFGQSQGL YPDNYP