Gene GSU3271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3271 
Symbol 
ID2687666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3588092 
End bp3589501 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content63% 
IMG OID637127963 
Producthypothetical protein 
Protein accessionNP_954311 
Protein GI39998360 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3659] Carbohydrate-selective porin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0437682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGAACA TGAAGAAGGT GTTGGTGGTG GGTCTCGCGC TGCTCGGGAG TCTGGTCGTC 
GGCTCCACGG GAGCCCAGGC GCGCAATCCG GAATTTGCTC TTCCCGAGAA GGTCGAGGTA
AAGCACAAGG CCTGCCAGGA GATCTTACGG CTTGCGGCCA CGTATCAGGT GGAGGGGTTG
TTCTCAAAGG AATTCGAGGC GGGCCAGGTG TGCTACACCC GCACGGACCT GGCGGTGGTG
CTGGAGCTGC TGACCGAGAA ACTGGCGGAG AAAGTGGTGA AGGAAGGCTC GGCCGCCGTG
GCCAAGGAAG ACCTGGTGCT CCTGGCGGAG CTCCAGGACG AACTGCGCGG GGAAATGCTC
CTGGCACGCA CCCGTACCTT CCAGCAGCGC CGGGAGGGTC TCGGCACCAG GCTCACCGCC
ATCACCAAGA ACATCTCCCT CAGCGGGGGA CTGGTGGGGG TCCTCCAGGG CTCCATCGGC
AATGAGCCCT CCGACCACGT GGACGTGGTC GGCAGGGGAG ACCTGGTCTT CAGTTTCAAG
GTGGGTGAGA ACACCATCGC GGTGATCGAC GTGGAGGCGA CCGGCGGCAA CGGCATCGAC
ACCAGGGTGC CCAGCTTCTC GCTGCTGAAC GCCGTGGCCG GCAGCACCGG TGATACGGTC
CGCTTCCGCG AAGCATGGGT AGAGCATGCG GCGTTCGATG AACGCCTGAT CCTAACCGCC
GGCAAGATCG ACCTGACCAA CTACTTCGAC GCCAACGGCG TGGCAAACGA CGAGAACAGC
CAGTTTCTGG CCGGAGCCTT CGTGAACTCG GCGGTTCTGG GTGCCCCCGG CAACGGGCCG
GGAGCCAGGC TCCAGGCAAA GCTGGGCGAG CCGCTCACCT TCGGCCTCGG CTACGGCAGC
GGCGACACCG ACACGGAGGA TGTGTTCTCG CACGGCTACG GAATAGCCGA GCTGGACTAC
ACCCTCAAGG TAGGTGAACT GGAAGGGAAC TACCGCGTCT ATGGCAGCCT GGACGGAGCG
CGTGCCGATG GCGAGGTGAA GCTGCAGGAA AAAAACGCCT TCGGGTTCGG GATCAGCGCT
GACCAGCAGG TGACCGACAA GCTGACCCTG TTCGTGCGCT ATGGCCAGCG TGACGAGGAC
GTCTATGCCA CGAAGATGGC CTGGAGCGCC GGCGGACAGT ACGCCGGGCT GCTTCCCGAG
CGCAAGGATG ATGTCCTCGG CTTTGCCTAC GGACAGGTGA AGGCCGTGGG TGCTGACTCA
CAGGAAAAAC TGGCGGAGCT CTACTACAAG GTCCAGGTGA ACGAGCAGAT CAGCATCGCG
CCTGTGGTTC AGTACCTGAT CAATCCTCTG GGGGACAGCA GCAGGGATGA CGTGGTGGCA
CTGGGGCTGC GCTCACTGAT CAGTTTTTAA
 
Protein sequence
MRNMKKVLVV GLALLGSLVV GSTGAQARNP EFALPEKVEV KHKACQEILR LAATYQVEGL 
FSKEFEAGQV CYTRTDLAVV LELLTEKLAE KVVKEGSAAV AKEDLVLLAE LQDELRGEML
LARTRTFQQR REGLGTRLTA ITKNISLSGG LVGVLQGSIG NEPSDHVDVV GRGDLVFSFK
VGENTIAVID VEATGGNGID TRVPSFSLLN AVAGSTGDTV RFREAWVEHA AFDERLILTA
GKIDLTNYFD ANGVANDENS QFLAGAFVNS AVLGAPGNGP GARLQAKLGE PLTFGLGYGS
GDTDTEDVFS HGYGIAELDY TLKVGELEGN YRVYGSLDGA RADGEVKLQE KNAFGFGISA
DQQVTDKLTL FVRYGQRDED VYATKMAWSA GGQYAGLLPE RKDDVLGFAY GQVKAVGADS
QEKLAELYYK VQVNEQISIA PVVQYLINPL GDSSRDDVVA LGLRSLISF