Gene GSU2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2201 
Symbol 
ID2686687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2414494 
End bp2415864 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content59% 
IMG OID637126894 
Productcytochrome c family protein 
Protein accessionNP_953250 
Protein GI39997299 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.826403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATA CGACGTTCAT CGCCACCCTC ATGGCGAGCG TGGCAGTGGC GGCCCTGGTA 
CAGGCCAAGG ATCATCCGGG CAAGGAATAT ATCCAGAAAA ACGGCTACCA GGGGCCGGCA
ACCTGCGAGG TCTGCCATCC CGGCGCGGCA AAGGAGTTCC TGAACTCCGT GCACTGGAAG
CACGCCTCGA AGGTCGACAA CGTTGAGAAC ATTGACCCGA AGCAGGAATA CGGCATGAAA
AACCGTATCT ACACCATGTG CAACGGGAAC GACATCGTCA ACAATCTGAA GGAGATTCCG
CCCAGCCCCG AGACGGGCAA GACCAAGTAC TCGGGCTGCA ACTCCTGCCA TCCCGGGAAT
CACATCCAGG ACGTGGGGAG CACCGGGCCC GAAGCGGAAG CCGCAGTGGA CTGCCTTGTC
TGCCACTCCT CCACCTATGA TCACAGCAAG CGCAAGCCCT TCAAAGACGA GAAGGGGAAC
GTGGTGCTCG GCCAGGATCG CAGCACCGAT GCGGCCCTTT CCATTGCCAC CCCGACGGTC
AAGAACTGTA TGACCTGCCA CGAGGCTGCA GGGGGCGGCG TGCTGGTGAA GCGCGGGTTC
GCCTTCAACA AGGAGCACGA TGTCCATGCG GCCAAGGGGA TGGTCTGCGT CGACTGCCAC
AAGACGAAGA ACCACAAGAT CCCCACGGGC TACGATCCGA ACAACTGGGC CCATGACGGC
GTACGTCTCT CCTGCACCGA CTGCCACACG GCAAAGCCCC ACAAGGACGA GGACTACAAC
CGCCATACGG CGCGCATCGC CTGTCAGACC TGCCACATCC CCCGGACCGG CGGCGCCTTT
GCCAAGGATT TCACGAAGTG GGAACAGCTC TCCAACAAGT TCTACGAGCC GACCACGCTC
AAAAAAGAAG CAAATGAAAC GGCTCCCGTC TACGCATGGT ACAACCTGAC CGTGGCCAAC
CGCCCTGACT TCATCGGGCC CAAGGGTGAC CGCAAGGACG GCAAGAGCAA GATCTACCCT
TTCAAGATCT TCCAGGGCAA GGCGTACTTT AACAAGAAGG ACGGCCAGCT CCTGTCCATG
GACTTCGCTC CGCCCATGGC AACGGGTGAC ACCCTCGCCG GCGTGGCATC AGCTGCCAAA
ATCCTTGGGA TCAAGGATTA CGAACCGGTC CCCGGCTGGC AGACCATCTA CTTCGGCAGC
AACCACCAGG TGGCTCCCAA GGAGAAGGCT CTTACCTGCT ATAACTGCCA TGCTCCCAAC
GGCATTCTGA ACTTCCGCGA GCTCGGCTAT TCGTCCGACG AGGTCAAAAA GCTCACGAGC
CCCGAACTCT ACTTTGAGAA AATCGCAGAG AAGATGCGGG AAGAGTGGTA A
 
Protein sequence
MKNTTFIATL MASVAVAALV QAKDHPGKEY IQKNGYQGPA TCEVCHPGAA KEFLNSVHWK 
HASKVDNVEN IDPKQEYGMK NRIYTMCNGN DIVNNLKEIP PSPETGKTKY SGCNSCHPGN
HIQDVGSTGP EAEAAVDCLV CHSSTYDHSK RKPFKDEKGN VVLGQDRSTD AALSIATPTV
KNCMTCHEAA GGGVLVKRGF AFNKEHDVHA AKGMVCVDCH KTKNHKIPTG YDPNNWAHDG
VRLSCTDCHT AKPHKDEDYN RHTARIACQT CHIPRTGGAF AKDFTKWEQL SNKFYEPTTL
KKEANETAPV YAWYNLTVAN RPDFIGPKGD RKDGKSKIYP FKIFQGKAYF NKKDGQLLSM
DFAPPMATGD TLAGVASAAK ILGIKDYEPV PGWQTIYFGS NHQVAPKEKA LTCYNCHAPN
GILNFRELGY SSDEVKKLTS PELYFEKIAE KMREEW