Gene GSU2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2119 
Symbol 
ID2687742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2337618 
End bp2338979 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content48% 
IMG OID637126811 
Productintegrative genetic element Gsu56, integrase 
Protein accessionNP_953168 
Protein GI39997217 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAT TTACCGACCG TTACCTCACC TCTCTCAAAC CTCAAGATAA AAAATATGTC 
GTAAGAGAGG GCCGAGGATT CGCGATCCAG GTGCTGCCTT CAGGAACAAA GACATTTATG
TATATCTTTG AACTGAACAA GCAGAAGGGG TACCTACTGC TCGGCAATTA TCCCGCCATG
TCCTTGGGTG ATGCCCGGAT AGCTTACAAC GACGCATATA AACTCGTCAA GAACGGCATC
GATCCACGCG AGGAAAAAAG AACAGCTATC GAGGAACAAT CTCGATTGGC AAATGAGGCT
AGACTTCAAG CTGAAGCTGC AGCCCTTGCT GCTGAGAAAT TGGAAAAAGA TTCCTTTGAT
TCCCTGATAG AGGATGAACT CCCCGAAGGA TACACCCCGA TAACCGTAGA ACAGCTCGCT
GCGATATGGT ACGTCAAATA CTCTAAGGAG AATCATTCAG TTCGATGGCG AGAAACTATC
CTTAGCGCTA TCAAGACCCA CATCATTCCC GGTATCGGCA AAATGGAAAT TTCCTCTGTC
AGACACAAGC ATGCTGTTTC TCTCATCGAG CAAATTGCAT CCAAGGTCCC GGGATCGGCT
CGTAACGTGA TGAAATTTGG CAGACAAATG TTCAAATATG CCTGTCGGCA AGAGTGGGCG
GAGATTCAGC CGTTCCAGGA GATCACAGCA TCTGTCCCCA AGATTGCCCC CAAAACTGAC
GACCGGCATC TTGATGACGA CGAAATCGTG AAGGCGTGGA AAGAAATCAG CAAGGGACCA
AGCTCTACCG AGGTCAAGCG TGCGCTTAAA TTGATTCTGG TAACCGCTCA GCGCCCCGGA
GAAGTTGCAC AAATTCACCG TGATCAGATC AAGGACAGAT GGTGGACTAT CCCTGCAGAG
GTTGCTGGCA AAAATGAACG TGAGCACAGA GTCTACTTGA CTGACACTGC TCTGGAGCTG
ATCGGACAAG GTAAAGGGTA CATCTTCTCA TCTGGCCGAG GGAAAAGAGG CCATATTTCC
GAGAACACTC TTTCACAAGC CATAAATCGA GGTTATTTGG ACGAAGATGT TGTGAAAGTT
GTTGGGAACA GAAAAATCAA AGCGCGCAAA GAACCTTACT TCGGGATGAA GCCATGGTCG
CCGCATGATC TTCGCCGAAC CGCACGCACA AATATGGCAC GAGTTGGCAT TACAGACGAA
GTTGGCGAAG AAGTCATAAA TCACATCAAG CCAGGCATAG TCGGCGTTTA CAATAAATAT
CGTTATGACA ATGAGAAAAA GGACGCCCTT TTGAAGTGGG AAGCCTTGTT ACTGAACATT
CTGTCACCCA AACCGCAGGA TAGTAATGCA GATGGAGAAT AG
 
Protein sequence
MKQFTDRYLT SLKPQDKKYV VREGRGFAIQ VLPSGTKTFM YIFELNKQKG YLLLGNYPAM 
SLGDARIAYN DAYKLVKNGI DPREEKRTAI EEQSRLANEA RLQAEAAALA AEKLEKDSFD
SLIEDELPEG YTPITVEQLA AIWYVKYSKE NHSVRWRETI LSAIKTHIIP GIGKMEISSV
RHKHAVSLIE QIASKVPGSA RNVMKFGRQM FKYACRQEWA EIQPFQEITA SVPKIAPKTD
DRHLDDDEIV KAWKEISKGP SSTEVKRALK LILVTAQRPG EVAQIHRDQI KDRWWTIPAE
VAGKNEREHR VYLTDTALEL IGQGKGYIFS SGRGKRGHIS ENTLSQAINR GYLDEDVVKV
VGNRKIKARK EPYFGMKPWS PHDLRRTART NMARVGITDE VGEEVINHIK PGIVGVYNKY
RYDNEKKDAL LKWEALLLNI LSPKPQDSNA DGE