Gene GSU1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1010 
Symbol 
ID2687460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1091283 
End bp1092803 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content60% 
IMG OID637125680 
ProductSlt family transglycosylase 
Protein accessionNP_952064 
Protein GI39996113 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGAT CTTTTTTCCT GCTGGCCATA GTCCTGCTGT TTACACCCAT TGCCCAGGCA 
TCCGATCTTC ACTTGAACCC CCTGCGGGAG TTGACCTCCC TGGGGAGCGG TTCGCTGCAG
GCCGATCTGT CGGGAGTTGT TCCCCGCGTG GGATCGGCGC GGCCAAAGGC TGCCTCCCCT
GCCGAGGGGC GCCGGGCGAA TCCGCTGGAG CCGTCCATGG GAGAGCTCAT CGTCGTGGAG
GACGAGACAT CCCTTGAGGA TGATTTCGAA CTGCAGCTCC CCGGCCAGGA CCTCCCCGAA
TCGGACATCC CGCTTGCCCT CAACGGCAAG GTAGAGTACT TCATCTCCTA TTTCCAGACC
TCCGGCCGCA AGTCCTTTTC CCGCTGGCTT TCCCGCTCCG AACGATACAT TCCCATGATG
CGCGAGGTTC TCAGGAAGGA AGGACTTCCC GAGGATCTGG TCTACCTGGC CATGATCGAG
AGCGGCTTTA CGCCCCATGC GGTTTCCGTG GCGAGCGCCG TGGGCCCCTG GCAGTTCATC
TCGGGCACGG GAAAACGTTA CGACCTGAGG ATCGACCAAT GGATCGACGA ACGGCGCGAT
CCGCTCAAGT CGACCGTTGC CGCCGCCATG TACCTGAAGG AGCTTTACTC CCTCTTCAAT
CAGGATTGGT ATCTGGCTGC GGCAGGCTAT AACGCCGGCG AGAACAAGAT CCTGCGCGCC
ATCGACAAAT ACAACACGCG GGACTTCTGG GAAATATCCA AGGGCTCGTA TTTGAAGAGG
GAGACCAAGG ATTACGTGCC GAAGCTCCTG GCCGCCGCCA TCATCGCAAA GGAGCCGGCC
CGCTACGGCT TCGCCGATGT GGCGTATCTT CCCCCCATCG AGTTCGACTT AGTTGCCATT
CCTTCGCGCA CCGATCTGGA CCTGGTGGCC AAACTCTGCG AGGTGGATGT CAAGGCCATC
AAGGAATTGA ACCCGGAACT GCGCCGCTGG TGCACACCTC CCGACTACCC CGACTACGAG
CTCAAAATCC CCAAGGGAAA GCGCACGTCC TTCGAGGAGG CATACGCCCA TCTCCCCGCG
GACCAGCGCT ACGTCGAGCG GATTGTCTAC AGCCGCTACC GGGTTAAGAA AAAGGATACC
CTGCAGGCGA TCGCGCGACG CTACGGCACC ACTGCCGAGA CCCTGGCCGA GGTTAACAAA
CTGAAGCCGA CCTCGAAGCT CCGGGGCCGC ACCCTGCTGG TGCCGGTGCC GGTCGCGACG
GAGGATGCCG CGGAAAGGAC CGTCGCCAAG GCGTCGCCGA AGAAGGACGA GTCCCGCGCA
TTCAACAAGT ACTACACGGT CAAGAAAGGC GACACCGTCG CCTCGCTGTC CAAGAAATTC
AACATTTCCC AACGGATTCT GGCAGCATGG AATAATTTGA AGGGCAAAAT GGCCCTTCAC
CCCGGCAAGC GGATCATCGT CGCCAAGTAT GTGGAGAAAA AAGGGTCGAT GGTGCCGGTC
GACGGCGGGG AGAACAGCTA G
 
Protein sequence
MNRSFFLLAI VLLFTPIAQA SDLHLNPLRE LTSLGSGSLQ ADLSGVVPRV GSARPKAASP 
AEGRRANPLE PSMGELIVVE DETSLEDDFE LQLPGQDLPE SDIPLALNGK VEYFISYFQT
SGRKSFSRWL SRSERYIPMM REVLRKEGLP EDLVYLAMIE SGFTPHAVSV ASAVGPWQFI
SGTGKRYDLR IDQWIDERRD PLKSTVAAAM YLKELYSLFN QDWYLAAAGY NAGENKILRA
IDKYNTRDFW EISKGSYLKR ETKDYVPKLL AAAIIAKEPA RYGFADVAYL PPIEFDLVAI
PSRTDLDLVA KLCEVDVKAI KELNPELRRW CTPPDYPDYE LKIPKGKRTS FEEAYAHLPA
DQRYVERIVY SRYRVKKKDT LQAIARRYGT TAETLAEVNK LKPTSKLRGR TLLVPVPVAT
EDAAERTVAK ASPKKDESRA FNKYYTVKKG DTVASLSKKF NISQRILAAW NNLKGKMALH
PGKRIIVAKY VEKKGSMVPV DGGENS