Gene GSU0435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0435 
Symbol 
ID2686392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp466122 
End bp467771 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content66% 
IMG OID637125100 
ProductMSHA biogenesis protein MshE, putative 
Protein accessionNP_951494 
Protein GI39995543 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGCA TCGTCAAGGA AGGGTCTCTC GGGTCCATTC TCTTCAAATG CCAGATCATC 
AGCGAGGACG ACATCCGGCG GGCACTCGAT GAGCAGGAGC GCACCGGAGG CCGCTTCGGC
GAGGCACTGG TATCACTCGG CATTGTAACC CAGGAAGACA TCGACTGGGC CCTGTCGAAC
CAGCTCAACA TCCCCTACGT ACGCCTCAAG CCGGCCATGG TCGACCGGGA TGCCGTGGCG
TTGGTCCCGG CGGTCATGGC CCGACAGCAT AATCTCATCC CCCTGATCAG GGCCGGCGAG
GAACTGAGCA TCGCCATTGC CGACCCGCTC AACGTGGCGG CAGTGGCGGC CGTGGAAAAG
GAGACCGGGT GCGCCGTGTC GGTTTCCGTG GCGCTCATCC GGGAGATCCG GGAGATGCAG
GAGCGTTTCT ACGGACCGCC CGACACGGAG GAACGCCTGG GGTTCACGTC GTCGGCGTTC
CCACCCCAGG CCCTTGCCGC CATGAATCAC GACCTGACAG GGGGGAAGTT CATCGATTAC
CTGTTGCTGT TCGTGGCCCA GCAAAAGCTC AGCTCCCTTT CGCTCCACCC CCTGGGGGAC
AGGGTGTCGG TGATTGGCCG GCGCGGCGGC ACCACGCGGG AGGTCGGACA GCTTGCCCCT
TCCCGCTATC CCGACGTGGT CATGCACGTC AAGAAACTTG CCCACATCGA CGGGGCCCGG
TTCTCCGCCC GTGGGGGGCT ATCCTTTGCC CTGAAGGGCC GCTCCATTCC CTTTCAGGTG
GCTACCCTGC GGGGAGAAGG CGGCGATCAC CTCACCTTCA GGATGACGGT GGCGGCATTG
TTCCCGACAT CCCTTGCCGA CCTGGGGCTG ACGGACGACC AAGTCCGGCA GTTTGCCGAT
CTGGCGGCGG CCGGTCGCGG CATGGTGGTG ACCGGAGCCC GGGATCGGGA GATTCGCCGC
CGGCTCACGG ATCTCTACCT TCAGGAGCAT GAGGCGGAGG GGAAGACCGT GCTGGTCGTC
GGCAGCGGCG CCGGCACGGG AGAGCAGCGG TTCCCCCGTA TTCCGGTGCC GTCCGACGCG
GATCTGAGCG CTGTGGTTTC AGCCTGTCTG GAGCATGATC CGGATATTCT CGTCCTGGAG
GATGTCACTG ACGGCCAGGC CTTTGCGGCA GCATGCCGGG CAACTCTGCG CGGTAAGCTG
GTGGTGGCGG GGATCGGCTG CGGTGACGCC GTCGGCGCCC TGGACCAGCT TATCGCCTTC
CGGGACATGC ACGTTCTCGT GCCTGCGTAT CTGCGGGGGG TGATTACCTG CACGCCGATC
CGGCCCCTGT GCCCCGCATG CCGGCGAAGC GAGCCATTCC CCGCCGCAGA GCGGGCGGCT
CTGGGGATCG GTGCCGACGT CACCTCTTGC TGGCGGTCGG CGGGATGCGA ATCCTGCGAC
CAGACCGGCC ATGACGGCAG GCGCTACCTG CTGGATGTGC TGGTTTTGGA CCACGACCTC
CGGGAGCGGT TCGAGGCGGC CCGCAACGGG GCAGAGGTGA TCGAGCATTT GCGCGGACAG
GGTTGGCGCG GGATCACGGA CGAGCGGCAG ACCCTGCTGG CTGAAGGCAC TATTTCGCTG
GAGGAGTACG CCTCCTCCCT GCACGGTTGA
 
Protein sequence
MESIVKEGSL GSILFKCQII SEDDIRRALD EQERTGGRFG EALVSLGIVT QEDIDWALSN 
QLNIPYVRLK PAMVDRDAVA LVPAVMARQH NLIPLIRAGE ELSIAIADPL NVAAVAAVEK
ETGCAVSVSV ALIREIREMQ ERFYGPPDTE ERLGFTSSAF PPQALAAMNH DLTGGKFIDY
LLLFVAQQKL SSLSLHPLGD RVSVIGRRGG TTREVGQLAP SRYPDVVMHV KKLAHIDGAR
FSARGGLSFA LKGRSIPFQV ATLRGEGGDH LTFRMTVAAL FPTSLADLGL TDDQVRQFAD
LAAAGRGMVV TGARDREIRR RLTDLYLQEH EAEGKTVLVV GSGAGTGEQR FPRIPVPSDA
DLSAVVSACL EHDPDILVLE DVTDGQAFAA ACRATLRGKL VVAGIGCGDA VGALDQLIAF
RDMHVLVPAY LRGVITCTPI RPLCPACRRS EPFPAAERAA LGIGADVTSC WRSAGCESCD
QTGHDGRRYL LDVLVLDHDL RERFEAARNG AEVIEHLRGQ GWRGITDERQ TLLAEGTISL
EEYASSLHG