Gene Spro_2667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2667 
Symbol 
ID5605847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2936743 
End bp2938305 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content60% 
IMG OID640938206 
Productanthranilate synthase component I 
Protein accessionYP_001478896 
Protein GI157370907 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000050779 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGAACA TCAAACCACA ACTCAAATTA CTGAAGGCGG AGGCCAGTTA CCGGGGCGAT 
CCGACCACCA TTTTCCACCA GCTGTGCGGC GCTCGTCCGG CCACCCTGCT GTTGGAATCG
GCTGAAATCA ACAGCAAGCA AAACCTGCAA AGCCTGTTGG TCATTGACAG CGCTCTGCGC
ATTACCGCAC TGGGCCGCAC CGTGACGCTG CATGCGCTGA CCGCCAACGG CGCGGCGATG
CTGCCGCTGC TGGATGAGGC CCTGCCGGCA GAAGTCCAGA ACCAGGTGCG TCCTAATGGA
CGTGAACTGA CTTTCCCGGT GATTGATGCG ATTCAGGATG AAGATGCCCG CCTGCGTTCG
CTGTCGGTGT TTGATGCACT GCGCACCCTG CTGACACTGG TTGACTCCCC GGCTGACGAA
CGTGAAGCAG TGATGCTCGG CGGACTCTTT GCCTACGACT TGGTCGCCGG GTTCGAAGAC
CTGCCGCCAC TGCGTCAGGA AAACCGCTGC CCGGACTTCT GCTTCTATCT GGCGGAAACC
TTGTTGGTGC TGGATCATCA ACGCAGCGTT GCCCGTCTTC AGGCCAGCGT TTTCACGGCC
GATACGGCGG AAGAACAGCG CTTGCAACAG CGCCTGGAGC AGTTGCAGCT GCAATTGAAA
CAGACCCCAC AGCCGATCCC GCACCAGAAG CTGGAAAACA TGCAACTGAG CTGTAACCAG
ACCGATGAAG AATACGGTGC GGTTGTCAGC GAATTGCAAC AGGCCATCCG TCAGGGCGAA
ATCTTCCAGG TGGTGCCGTC GCGCCGTTTC TCGCTGCCGT GCCCGGCCCC GTTGGCCGCT
TACCAGACGC TGAAGGACAA CAACCCAAGC CCATACATGT TCTATATGCA GGATGACGAG
TTCACCCTGT TCGGTGCTTC GCCGGAAAGC GCGCTGAAAT ACGACGCCGG CAACCGCCAG
ATCGAGATCT ACCCGATTGC CGGTACCCGT CCTCGCGGCC GTCGCGCCGA CGGTTCGCTG
GATCTGGATC TCGACAGCCG TATCGAGCTG GAAATGCGGA CCGATCATAA AGAACTGGCC
GAGCACCTGA TGCTGGTCGA TCTGGCGCGT AACGATCTGG CGCGCATCTG TCAGGCCGGT
AGCCGCTATG TGGCCGACCT GACCAAAGTG GACCGCTACT CATTCGTGAT GCACCTGGTG
TCTCGGGTAA TCGGCACCCT GCGCGCCGAC CTCGACGTGC TGCACGCTTA TCAGGCCTGT
ATGAACATGG GCACCCTGAG CGGCGCCCCC AAAGTGCGCG CCATGCAGTT AATCGCCGCC
TCTGAAGGTA CCCGCCGCGG CAGCTACGGC GGTGCGGTCG GTTATTTCAC CGCCACCGGC
GATTTGGATA CCTGTATTGT CATCCGCTCC GCGTATGTTG AAGACGGCAT TGCTACCGTG
CAAGCCGGTG CCGGTGTGGT GTTGGATTCT GTTCCTCAGG CGGAAGCCGA TGAGACCCGT
AATAAGGCAC GTGCCGTGCT GCGTGCCATT GCCAGCGCGC ACCAGGCCAA GGAGGTGTTC
TGA
 
Protein sequence
MMNIKPQLKL LKAEASYRGD PTTIFHQLCG ARPATLLLES AEINSKQNLQ SLLVIDSALR 
ITALGRTVTL HALTANGAAM LPLLDEALPA EVQNQVRPNG RELTFPVIDA IQDEDARLRS
LSVFDALRTL LTLVDSPADE REAVMLGGLF AYDLVAGFED LPPLRQENRC PDFCFYLAET
LLVLDHQRSV ARLQASVFTA DTAEEQRLQQ RLEQLQLQLK QTPQPIPHQK LENMQLSCNQ
TDEEYGAVVS ELQQAIRQGE IFQVVPSRRF SLPCPAPLAA YQTLKDNNPS PYMFYMQDDE
FTLFGASPES ALKYDAGNRQ IEIYPIAGTR PRGRRADGSL DLDLDSRIEL EMRTDHKELA
EHLMLVDLAR NDLARICQAG SRYVADLTKV DRYSFVMHLV SRVIGTLRAD LDVLHAYQAC
MNMGTLSGAP KVRAMQLIAA SEGTRRGSYG GAVGYFTATG DLDTCIVIRS AYVEDGIATV
QAGAGVVLDS VPQAEADETR NKARAVLRAI ASAHQAKEVF