Gene GSU1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1083 
Symbol 
ID2688626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1168841 
End bp1169917 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content65% 
IMG OID637125752 
Producthypothetical protein 
Protein accessionNP_952136 
Protein GI39996185 
COG category[S] Function unknown 
COG ID[COG4260] Putative virion core protein (lumpy skin disease virus) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAACGG AGGGCCCCGT GGCGATCAAC AGCATGAATA CGGTATTTCT TGAGGTAGTC 
GAGTGGTTTG ACGACAGCGG CCGGGAGATG GTGCGCCGCA TCCCCCCCGA GGGGTCCGCC
GAGATCAAAT TGGGCGCCCA GCTGGTGGTG CGGGAGAGTC AGCGGGCGGT CTTCTTCAGG
GACGGCAAGG CCGCCGACTG CTTTGGCCCC GGTAGGCATA CCCTGACCAG CGCCAACCTG
CCGATCCTGA CTAAGCTCCT GTCGCTCCCC TGGGGAGGCA CCTCTCCCTT CCGCTGCGAA
GTCTGTTTCG TGGGAATACA GACCTTCACC GACCTGCGCT GGGGAACCAA GGAACCCGTT
GCCTTCCGGG ACAGCCGCTT TGGCATGGTA CGGCTGCGGG CCTTCGGCAC CTACACCCTG
CGGGTGGTCG ACCCGCAGCT CCTGGTGAAC GCTCTGGTGG GAACCCGGGG GCTTTACACC
AGCAGCGAGC TGGAGGAACT GTTCCGCGAC ATCATCGTGG CGCGGCTCAA CGACTACCTG
GGCGAAACCA TCGACTCGGT TCTGGATCTG CCCGCCCGCT ACGATGAGAC TTCTGCTGCG
CTCAAGGAAC GCCTGGCAGG AGACTTCGGA GGATTCGGCA TCGAACTGGC CGAACTTTAC
GTGAACGCGA TCACTCCCCC GCCGGAGGTC CAGAAGGCAA TCGACGAGCG CACCTCCATG
GAGGCGGCCG GCGATGTGGA TCGCTACCTG AAGTTCAAGG CGGCCCGAAG CCTGGAAGCG
GCGGCCTCGG CCGAGGGGGG CGGCGAGGCT GCCCAGGGGA TGGGGATCGG GGTCGGTGCC
GGTCTCGGCA TGATTCTCCC CGGTATGGTG GCGAATGCCA TGGCACAGGG AGCTGATGCG
TCCGCTCCTG TCGGCACGGG TAGCTGTCCC CGCTGTATGG CGCCGTTGGT GGCGGGTGGA
AATTTTTGCC ATCAGTGCGG CGCCCCGGTA GAGTCCGGCT TTTGCTCCGG CTGCGGCAAG
CCGCTGCCGA CAGAAGCTCG CTTCTGCCCC GGCTGCGGAC GGCAGGCCGG AGCATAG
 
Protein sequence
MRTEGPVAIN SMNTVFLEVV EWFDDSGREM VRRIPPEGSA EIKLGAQLVV RESQRAVFFR 
DGKAADCFGP GRHTLTSANL PILTKLLSLP WGGTSPFRCE VCFVGIQTFT DLRWGTKEPV
AFRDSRFGMV RLRAFGTYTL RVVDPQLLVN ALVGTRGLYT SSELEELFRD IIVARLNDYL
GETIDSVLDL PARYDETSAA LKERLAGDFG GFGIELAELY VNAITPPPEV QKAIDERTSM
EAAGDVDRYL KFKAARSLEA AASAEGGGEA AQGMGIGVGA GLGMILPGMV ANAMAQGADA
SAPVGTGSCP RCMAPLVAGG NFCHQCGAPV ESGFCSGCGK PLPTEARFCP GCGRQAGA