Gene Sputcn32_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSputcn32_2039 
Symbol 
ID5079465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella putrefaciens CN-32 
KingdomBacteria 
Replicon accessionNC_009438 
Strand
Start bp2324315 
End bp2325475 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content43% 
IMG OID640499201 
Producttetratricopeptide repeat protein 
Protein accessionYP_001183559 
Protein GI146293135 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000051 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGAGA TCCTCTTTCT GTTGCTTCCC ATTGCTGCCG GTTACGGTTG GTATATGGGT 
CGGCGGAGCA TAAGGCAGAA TCAAAGTAAT CAGCGCAAGC AATTAAGTCG TGATTATTTC
ACGGGCCTTA ATTTCTTATT GTCCAATGAG TCGGATAAAG CGGTTGATTT ATTTATCAGT
ATGTTAGATG TAGACGATGA GACTATCGAC ACTCATTTGT CCTTAGGTTC ACTTTTCCGT
AAACGTGGCG AAGTTGATCG CTCCATTCGT ATTCACCAAA ATTTGATTGC TAGGCCTACA
CTCACCAATG AGCAGCGCGA CATTGCTATG ATGGAGCTGG GTAAAGATTA TCTGGCGGCA
GGATTTTATG ATAGGGCTGA AGAAATCTTT ATTAATCTTG TAAGCCAAGA TGATCACAGT
GAAGAATCTG AAACACAATT AATTGATATT TATCAAGTGA TTAAAGAGTG GCAAAAAGCT
ATTGATATCA CAAAACGTTT AAGCCGCAAG CGCCAGCAAG TGCTTAAACC TATCATTGCT
CACTTCTATT GTCAGTTGGC CGATGAAACC AATGATGACG ACAAAAAAAT TAAGCTATTA
CAACAGGCAC TTAAACAGGA TCCTAAGTGT GGCCGGGCCT TACTCACACT TGCAAAAAAA
TTCCTCGATA TTCAAGATTA CCCACAATGT AAAGCCATGC TAGCGGCGCT CAAAAAAGCG
GATATTGAAC TCTTTGCTGA TGCATTGCCG ACAGCGAAGC AAGTTTACCG TGATACCCAA
GATAAAGAGG GCTATCAGGA ATTACTTGCA GGGGCTATGG CAGAAGGCGC TGGAGCCTCT
GTGGTTGTCG CTCTCGCACA GCATTTGATC AGCCTTGATG AAATTAAAGC GGCTGAAAAT
ATAGTGCTTG ATTCGCTATA TCGCCATCCC ACCATGAAAG GCTTTCAACA CCTAATGCAG
ATGCACCTAC GTCAAGCTGA AGACGGACAA GCTAAGCAAA GTTTAGCTAT GTTAGAGCAA
CTTGTTGAAC AACAAATAAA ATTTCGCCCT AGTTACCGTT GTAAGGAGTG TGGTTTTCCA
TCACATGCGC TTTACTGGCA TTGCCCATCT TGTAAAAATT GGGGCAGTAT AAAACGGATC
AGAGGGCTTG ACGGCGAATA A
 
Protein sequence
MLEILFLLLP IAAGYGWYMG RRSIRQNQSN QRKQLSRDYF TGLNFLLSNE SDKAVDLFIS 
MLDVDDETID THLSLGSLFR KRGEVDRSIR IHQNLIARPT LTNEQRDIAM MELGKDYLAA
GFYDRAEEIF INLVSQDDHS EESETQLIDI YQVIKEWQKA IDITKRLSRK RQQVLKPIIA
HFYCQLADET NDDDKKIKLL QQALKQDPKC GRALLTLAKK FLDIQDYPQC KAMLAALKKA
DIELFADALP TAKQVYRDTQ DKEGYQELLA GAMAEGAGAS VVVALAQHLI SLDEIKAAEN
IVLDSLYRHP TMKGFQHLMQ MHLRQAEDGQ AKQSLAMLEQ LVEQQIKFRP SYRCKECGFP
SHALYWHCPS CKNWGSIKRI RGLDGE