Gene Sbal223_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3641 
Symbol 
ID7089575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4319891 
End bp4321381 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content47% 
IMG OID643462521 
Productferredoxin-dependent glutamate synthase 
Protein accessionYP_002359542 
Protein GI217974791 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.906462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.15864 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAC TCAATTGGTT TATGTGGGGA CTCGATCTGT TTTCAGGACT GTTTTTGATC 
GTTATAGGAC TCGCAGTACT CGCGGTTATT TACATGTATA TCGCAGATAA AATGCAGACC
AAACAAGCGG TAAGACATAA CTATCCCGTG ATTGGTCGTT TTCGATATCT GTTCGAAAAA
CAAGGTGAGT TTTTCAGACA ATACTTTTTT GCCCAAGACA GGGAAGAATT GCCCTTCAAC
CGTGCTGAAC GCAGCTGGGT GTACCGCGCC GCCAAAAATG TCGATAGAAC CATAGCCTTT
GGTTCGACTC GTCCTTTGGA TACAGCTGGT ACTATTATGT TTATGAATAC CGCCTTCCCA
ACCCAAGATG AAGATATCAC TCCTATTCAT CCGCTGACGA TTGGCACTCA CTGTCGTCAA
CCTTATACGA CTCAAGCCAT TTGCCATATT TCAGCCATGA GTTTTGGCGC CCTATCGCGC
CCAGCCATCA CCGCCCTTTC CCATGGCGCA GCGCAAGCGG GTTGCTGGCT TAATACGGGT
GAAGGCGGCT TGAGCCCTTA TCATCTCAAA GGTGGCTGCG ATCTCGTCTT TCAAATTGGA
ACAGCAAAAT ATGGCGTTCG CAATGAACAA GGACATTTAG ACGATGAAAA ACTGAAAGCG
ATAGCGATTC ATCCTGAAGT CAAAATGTTC GAAATCAAAA TGAGCCAAGG CGCAAAACCT
GGCAAAGGTG GCATTTTACC CGGTATTAAA GTGACCGCTG AGATTGCCCA TATTCGCGGT
ATTCCAGAGG GACACGACTC AATCAGCCCC AATGGCCACA TCGAATTTAA GTCAGTAAAC
GATATTTTAG ATATGGTTGA GCGAGTTCGT GAAGTCACAG GTAAACCTAC CGGCATTAAA
GCCGTACTCG GCGATGTGCA GTGGCTGGAA GATTTATGTG ATGAAATTGA ACGCCGCGGC
GAAGACTCCG CCCCCGACTT TTTCACCTTA GACAGCGCCG ATGGTGGCAC AGGCGCAGCA
CCGCAACCAT TAATGGATTA TGTCGGATTA CCACTGAAAG AAAGCTTACC TATCCTAGTC
AATATCTTGA TCCAACGCGG CTTGCGTAAA CGCATTAAAG TCATCGCCTC GGGCAAACTT
ATCGTCCCAT CCAGAGTCGC TTGGGCTTTG GCATTAGGCG CCGACTTTAT CGCATCGGCC
CGTGGCAACA TGTTCGCCCT CGGTTGTATT CAAGCCTTGC AGTGTAATAA AGATACCTGC
CCAACGGGTA TCACGACGCA CAATCCAAAA CTACAACAAG GGCTAAATCC TAGGGATAAG
TCGACTCGGG TCGCTAGCTA TAATCATAAT TTACACCATG ACTTAGGGCT GATCGCGCAC
TCTTGCGGTG TGACAGAGCC AAGACAGCTC AAGCCTTCAC ATGTACGAAT TGTGCTCGAT
AGCGGCTTAT CCATCTCACT CGACAAATTC TATTCGCACA TGGATAAATA G
 
Protein sequence
MSELNWFMWG LDLFSGLFLI VIGLAVLAVI YMYIADKMQT KQAVRHNYPV IGRFRYLFEK 
QGEFFRQYFF AQDREELPFN RAERSWVYRA AKNVDRTIAF GSTRPLDTAG TIMFMNTAFP
TQDEDITPIH PLTIGTHCRQ PYTTQAICHI SAMSFGALSR PAITALSHGA AQAGCWLNTG
EGGLSPYHLK GGCDLVFQIG TAKYGVRNEQ GHLDDEKLKA IAIHPEVKMF EIKMSQGAKP
GKGGILPGIK VTAEIAHIRG IPEGHDSISP NGHIEFKSVN DILDMVERVR EVTGKPTGIK
AVLGDVQWLE DLCDEIERRG EDSAPDFFTL DSADGGTGAA PQPLMDYVGL PLKESLPILV
NILIQRGLRK RIKVIASGKL IVPSRVAWAL ALGADFIASA RGNMFALGCI QALQCNKDTC
PTGITTHNPK LQQGLNPRDK STRVASYNHN LHHDLGLIAH SCGVTEPRQL KPSHVRIVLD
SGLSISLDKF YSHMDK