Gene Ssed_2784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_2784 
Symbol 
ID5612812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp3357680 
End bp3358990 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content49% 
IMG OID640933703 
Producthypothetical protein 
Protein accessionYP_001474519 
Protein GI157375919 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2706] 3-carboxymuconate cyclase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.562572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTGA TAAAGTTAAT ACAGCGCCTC TTGCCCTTGC TGGTGTGCGC CTCACTTTTC 
GGTTGCTCCC TCACTTCTCA TAGTACAAGT TCAGATAGCA AGCTCACTGC GACAATTCCT
ACGACAATGG TGCAGGTATT GATTGATAAC GAAGCGTGCA TCGATGGCTT AGATAACCCC
AGAGCGGTAA AAATCTCCCC CGATGGCACT CACGCTGTCG TTGCCAGCGG TGACGATAAT
TCACTGGCTA TCTTCGATAT AGATGATGAT TTCACCTTGA GTTTTAATCG GGTATTTAGA
AACAATAGCT ATGGCGTTAC CGGTCTGGAG GGCGCTTCGC AGGTGGCTTT TCTTCCCAGT
GGTAATAAGA TGTTTACCGT GAGCTTTTAT GACAGTGCGC TGGTGGTCTT CGAGCGCGGT
GAAGAACATC AATATAGGTT CAAGCGCAGG CTGAGTGACG GGCTGAGTAT TGAACGCATC
TTTAAGGACC CGGAGCCCTT TGGGGCTATA GATACACTAG GTTTACTAGG GGCTTGGGAT
GTCGTTGTTT CCGGCGATGG CCAACAGCTA TTTATTGCCA GTTATAAGAG TAATGCGCTC
TCGGTATTTG ACGTATCAGG TGACAAGGTG GTTCCTAATC GAGTCGAGGG AGGGGAGTCG
GGGTTAGGTG GCGCCGTAGC GCTCGCCCTG TCAGCCGATA ATACTCTACT GGCTATTACA
GGCTTTGATG AACATATGCT GACACTATAT AACCGCACAA TGGATGGTGA GTTAGAGTTA
AGTCAGACCT TGAGAGGAGG GGATGTGGGG ATCCCTCAGC TGGTTAACCC ACAAGCCGTT
AGATTTTCGC CCGATGGTCA TTTTATCTAT GTCGCCTGTT CGGGAAGTGA CGCTATTGTG
GTACTTAATC GAATTGAAGA AGTGTCCGGT TCTGGGACCG GGAAAGGGGA AGATGCGGCA
AAGTATGCCC TGTTCCAAAT CCTGACCAAC GATCAACTCG GCGGTGGCTT GAAAGGTGCC
GGCAGTTTAG CTGTTTCACC GGATGGACAG AGCGTTTTCG CTGCCGGTGA GGCTGATAGC
GGTGTACTCG TGTTGAGAAA AGAAAATGAC GGTCGGTTAA GCCTTGAATC AAAACTCTTT
GATACTGAGT TGGTAAATAA AGGCTTACCC GCCGGTGACA ACAAAGACAT TAATCTCCTG
GTAGGAGTCT CGTCACTTCA GCTCTCAAAA GATGGTCAAT ACTTGCTGGT GACGGCGGCA
AAGCAAGATT CACTTTATGT ACTTAAGCTA AAGGCTAGCG TATCAAAATA G
 
Protein sequence
MPLIKLIQRL LPLLVCASLF GCSLTSHSTS SDSKLTATIP TTMVQVLIDN EACIDGLDNP 
RAVKISPDGT HAVVASGDDN SLAIFDIDDD FTLSFNRVFR NNSYGVTGLE GASQVAFLPS
GNKMFTVSFY DSALVVFERG EEHQYRFKRR LSDGLSIERI FKDPEPFGAI DTLGLLGAWD
VVVSGDGQQL FIASYKSNAL SVFDVSGDKV VPNRVEGGES GLGGAVALAL SADNTLLAIT
GFDEHMLTLY NRTMDGELEL SQTLRGGDVG IPQLVNPQAV RFSPDGHFIY VACSGSDAIV
VLNRIEEVSG SGTGKGEDAA KYALFQILTN DQLGGGLKGA GSLAVSPDGQ SVFAAGEADS
GVLVLRKEND GRLSLESKLF DTELVNKGLP AGDNKDINLL VGVSSLQLSK DGQYLLVTAA
KQDSLYVLKL KASVSK