Gene Ssed_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_1686 
Symbol 
ID5610070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp2020106 
End bp2021665 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content50% 
IMG OID640932556 
Productanthranilate synthase component I 
Protein accessionYP_001473425 
Protein GI157374825 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAG CTCAAGTAAA GACGAGTAAA GAGAGGATCA CCTATCATGC CGATCCTCTC 
CGCTTGTATC AGCATCTTAC CCAAGACGCA CCACACACAA TGCTGCTCGA ATCGGCAGAG
ATAGATAGCA AAGACCATCT TAAAAGCATT GTGATGACAC ATGCAGCCAT GATGATCCGT
TGCGATAGTT ATCGACTGAC CTTCACCGCC TTGAGCGAAA ATGGTCACGA TCTTCTCTCT
GGCATTCAAC ACTTCTTCCA TGAGGCAGAC TCGAACTTAG AGGCTGGCAT ATTGACGCTC
ACCCTGAAAA AAGAGGTCGC TCAACTCGAT GAAGATGCCA GACTAAAATC CACCTCCCCT
CTCGATGGTC TCAGGGCATT AATCAAGGAG ATAGATTGCG GCGACTCTCC AGCCTTTGAG
GACCTATTCT TAGGTGGTGT GCTGGCTTAT GACCTCATCG ATACCGTAGA GCCGCTTCCC
CAGGTTCCTC AAGGTGATAA CACATGCCCC GATTACCTGT TTTATCTCGC TGAAACGCTA
ATTTTAATCG ATCACCAAGA GCAACAAGCC GATATAGTCA CTCATCAATT TAGCACCGTT
GAAAAGCTCC CCTCTTCACC ATATTCCGCG GTGCTGGAAC AGAGAGTCCA ATTACTGCAA
ACGATGTCAG TTCAAGAGAC TAAGGTTGAT GAGTTAGTCA CACTGGATGT ACAGACTCAG
GTCAACATCA GCGATGACTC ATTTAAAGCA ACGGTTAATG AACTGAAATC ACATATTGTC
GCCGGGGACA TCTTTCAGGT GGTTCCATCG CGCAGCTTTA GCCTGCCCTG CCCAAGTACA
TTAGGAGCAT ATCGCGCGCT TAGGCAGACC AATCCAAGCC CCTATATGTT CTATTTCAGA
GGCGAAGACT TCACCCTGTT TGGGGCTTCT CCTGAAAGCG CACTCAAATA TGAGGCTCAC
ACCAATCAAG TCGAGATCTA CCCCATTGCC GGGACCCGCA AGCGTGGAAA ATCGGCCGAT
GGTGAGATAG ATTTCGATCT GGATAGCCGA ATAGAGTTAG AGCTTCGCCT GGACAAAAAA
GAGTTATCGG AACACCTCAT GTTGGTCGAC CTAGCGCGTA ATGATGTGGC TCGGATCAGT
CAAAGTGGCT CGCGAAAAGT GGCTGAGCTG TTAAAGGTCG ACCGTTACTC ACATGTCATG
CATTTAGTCA GTCGCGTTAC CGGCCAACTT CGAGACGATC TCGATGCGCT GCACGCGTAC
CAGGCCTGCA TGAACATGGG CACACTGACC GGAGCACCAA AGGTCAGTGC AGCCCAGTTG
ATCCGCGTCG CCGAAAAAAC TCGCCGTGGC AGCTATGGCG GAGCGGTAGG CTACCTTAAT
GGCCTCGGAG ATATGGATAC CTGTATCGTC ATTCGCTCTG CATTTGTCAA AGATGGTACC
GCCTACATTC AGGCAGGTGC AGGCGTCGTA TTTGATTCAG ATCCACAAGC CGAAGCCGAT
GAAACTCGCC AAAAAGCGCA GGCCGTCATA TCAGCAATCA AACTAGGAGG CGGACAATGA
 
Protein sequence
MKIAQVKTSK ERITYHADPL RLYQHLTQDA PHTMLLESAE IDSKDHLKSI VMTHAAMMIR 
CDSYRLTFTA LSENGHDLLS GIQHFFHEAD SNLEAGILTL TLKKEVAQLD EDARLKSTSP
LDGLRALIKE IDCGDSPAFE DLFLGGVLAY DLIDTVEPLP QVPQGDNTCP DYLFYLAETL
ILIDHQEQQA DIVTHQFSTV EKLPSSPYSA VLEQRVQLLQ TMSVQETKVD ELVTLDVQTQ
VNISDDSFKA TVNELKSHIV AGDIFQVVPS RSFSLPCPST LGAYRALRQT NPSPYMFYFR
GEDFTLFGAS PESALKYEAH TNQVEIYPIA GTRKRGKSAD GEIDFDLDSR IELELRLDKK
ELSEHLMLVD LARNDVARIS QSGSRKVAEL LKVDRYSHVM HLVSRVTGQL RDDLDALHAY
QACMNMGTLT GAPKVSAAQL IRVAEKTRRG SYGGAVGYLN GLGDMDTCIV IRSAFVKDGT
AYIQAGAGVV FDSDPQAEAD ETRQKAQAVI SAIKLGGGQ