Gene Ssed_2359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_2359 
Symbol 
ID5612970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp2872233 
End bp2873642 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content47% 
IMG OID640933270 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001474096 
Protein GI157375496 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00205255 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000900079 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTTTTT CGCCGCAGCA GCAACTTGCT TTCAGGCATC TGAATTGGAA ATACACCACC 
ACAGAACTAT TCTCCCACCT CGCAGACAAG CCCTGGGCCA TGCTGCTCGA CTCGGCAGAT
GCTGCTCACC TGGATGCAAA ATTTGACATC ATAGTGTGCG ATCCTATCGC GACGATTGTA
ACCGATGGCC AATCGAGCAG GGTGAATCAT CTACAAGACG GCGAAATAGC AGATAGTCAA
GTTCACAGTG GTGACCCCTT CACGCTTCTC AATGACACGA TAAATCACTA TTTCCCTCAT
CAATATCCAA GCCCCTTACC TTTTAGCGGA GGCGCTGTGG GCTGTTTCAG TTACGATTTG
GGTCGCCAGA TTGAACATCT CCCCCAAATA GCGGCCAGAG ACATTTTACT GCCCGAAATG
AATGTTGGCT TGTATCCCTG GGCATTAATT TTCGATCGCT TAAACGCGTG CTGGACTCTG
GCTCATTACC ATGGAGAGGC TCCACTTGAG TCGACTCTGG CACAACTTAA TACCCTGCTC
GAAGCTAAAC CCAATTCGGT TACCGGTGAC TTCTCCTTAA CCAGTCAATG GATTAATCAG
ATAACTAAAT CACAATACAT TGAAAAATTT GATAAAATCC AATCCTACCT CAATAGTGGT
GATTGCTATC AGATAAATCT GACTCAGCGT TTTACCGCAA GCTATCGAGG CGATGAATGG
CGCGCATACC TCAAGCTGCG TGAGACAAAC AGAGCCCCCT TTTCGGCATT TATCCGATTA
GATGATGCGG CAATACTCTC TATCTCACCT GAGCGATTTA TTCAGCTTCG TGATGGTCAA
GTGCAAACAA AGCCCATAAA AGGAACACGG CCTCGATTTG AAAATGCAGA AGCAGACACC
TCTTCTGCAC TCGAACTCGC CGAATCAGAG AAAGATCGCG CCGAAAACTT AATGATTGTC
GATCTACTAC GAAATGACAT AGGCAAAGTT GCAAAAGCAG GCTCAGTCAA GGTTCCTCAC
CTTTTCCAAA TTGACAGTTT TCCAGCCGTC CACCACTTAG TCAGTACGGT AACGGCCGAG
TTACACAACA AATATCAAGC AACCGACCTG TTAAAAGCGG CTTTTCCCGG TGGCTCTATT
ACCGGTGCCC CAAAAATCCG TGCGATGCAG ATCATAGAAG AGCTTGAACC CTCGAGGCGC
AGCCTATATT GTGGGTCTAT TGGCTATATC AGCCAAGATG GACAGATGGA CACCAGTATT
ACTATTCGCA CATTAGTCGC ACAAGCCAAT CACATACACT GCTGGGCCGG CGGCGGGATT
GTCGCCGACT CCCAAGCTAA TGATGAATAT CAGGAAACCT TTGATAAGGT CAGTAAGATA
CTTCCTGTTC TTGAAAAGGT GGATTCTTAA
 
Protein sequence
MSFSPQQQLA FRHLNWKYTT TELFSHLADK PWAMLLDSAD AAHLDAKFDI IVCDPIATIV 
TDGQSSRVNH LQDGEIADSQ VHSGDPFTLL NDTINHYFPH QYPSPLPFSG GAVGCFSYDL
GRQIEHLPQI AARDILLPEM NVGLYPWALI FDRLNACWTL AHYHGEAPLE STLAQLNTLL
EAKPNSVTGD FSLTSQWINQ ITKSQYIEKF DKIQSYLNSG DCYQINLTQR FTASYRGDEW
RAYLKLRETN RAPFSAFIRL DDAAILSISP ERFIQLRDGQ VQTKPIKGTR PRFENAEADT
SSALELAESE KDRAENLMIV DLLRNDIGKV AKAGSVKVPH LFQIDSFPAV HHLVSTVTAE
LHNKYQATDL LKAAFPGGSI TGAPKIRAMQ IIEELEPSRR SLYCGSIGYI SQDGQMDTSI
TIRTLVAQAN HIHCWAGGGI VADSQANDEY QETFDKVSKI LPVLEKVDS