Gene Spro_3423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3423 
Symbol 
ID5604339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3788129 
End bp3789397 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content61% 
IMG OID640938976 
Productenterobactin exporter EntS 
Protein accessionYP_001479649 
Protein GI157371660 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.408147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.319337 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGC CCTCTATTTT GCTTGATTTT GGCCTGCTTA AGACCAATGG CGCTTTTCGT 
GCCGTTTTTT GCGCGCGCTT TATTTCCATT TTAGCGCTGG GGTTGATGGC CATTGCGATC
CCGGTGCAGA TCCAGGCCTT GACCGGGTCG ACGTTGCTGG TCGGGCTATC CGTAACGCTG
GCGGGTAGCG GCATGTTTGC CGGTTTGCTG ATGGGGGGCG TGCTGGCCGA CCGCTATGAA
CGCCGTCGCC TGATATTGTT TGCCCGCTCC ACCTGCGGCA TTGGTTTTGT CGGCCTGTGC
CTGAACGCCG CACTGCCCAC GCCTTCATTA ACGCTGATTT ACCTGCTGGC AGTCTGGGAC
GGCTTTTTCG GTGCGGTTGG CGTCACGGCG CTGCTGGCGG CGACACCGGC GTTGGTGGGG
CGTGAAAATA TTGTGCAGGC CGGGGCCATC AGCATGCTGA CGGTGCGTTT TGGTTCGATC
CTCTCTCCGG CGATCGGCGG GTTGGTGATT GCCAACGCCG GTGTGGCATG GAACTACGGT
CTGGCCGCAT TTGGCACCTT GCTGACGCTG ATCCCGTTAC TGCGTCTGCC ACAGTTGCTG
CCGCCCCCGC AGCCGCGTGA GCACCCGTTA CGTGCGCTGG CCGGTGGCTT TGGTTTCCTG
TTTCGCAACC GGGTGATTGG CATGGTGGCG CTGATCGGCG CGCTGTTGAC CATGGCCAGT
GCGGTGCGGG TGCTGTATCC GGCCATGTCC GGTATGTGGC AGGTCAGTGC GGCTCAACTG
GGCTTTATGT ATTCCGCGGT GCCACTGGGC GCAGCCATTG GTGCCTTTAC CAGCGGCCGG
GTGGCTCATG TCGCGCGTCC GGGCTGGATG ATGCTGATGA CGGCGATTGG CGCTTTCGTC
GCTATTGGCC TGTTCGGTCT GATGCCGTGG TACGGCCTGG CGCTGTTCTT CCTGGTGGTT
TTCGGCTACC TGAGCGCGTT GAACTCATTG CTGCAGTACG GGTTGATCCA GAACCTGACG
CCGGATGCGT TCCTCGGCCG TATCAATGGT CTGTGGACGG CGCAAAACGT GGTGGGTGAT
GCACTGGGCG CGCTGCTGCT GGGGGCCATG GGGGCGTTTA TGCTGCCGGC GATGACCTCC
ACCAGCTTTG GTTTTGGCGT GGCGCTACTC GGTGTGGTGC TGGCATTTGC GATGCGCGGT
TTGCGTCAGG TGGGCAGTGG CGGGCAAGAA AGCGATCTCC AGCCCGCCGC GGGATCTACT
GAGAAGTAA
 
Protein sequence
MSKPSILLDF GLLKTNGAFR AVFCARFISI LALGLMAIAI PVQIQALTGS TLLVGLSVTL 
AGSGMFAGLL MGGVLADRYE RRRLILFARS TCGIGFVGLC LNAALPTPSL TLIYLLAVWD
GFFGAVGVTA LLAATPALVG RENIVQAGAI SMLTVRFGSI LSPAIGGLVI ANAGVAWNYG
LAAFGTLLTL IPLLRLPQLL PPPQPREHPL RALAGGFGFL FRNRVIGMVA LIGALLTMAS
AVRVLYPAMS GMWQVSAAQL GFMYSAVPLG AAIGAFTSGR VAHVARPGWM MLMTAIGAFV
AIGLFGLMPW YGLALFFLVV FGYLSALNSL LQYGLIQNLT PDAFLGRING LWTAQNVVGD
ALGALLLGAM GAFMLPAMTS TSFGFGVALL GVVLAFAMRG LRQVGSGGQE SDLQPAAGST
EK