Gene Sros_4373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4373 
Symbol 
ID8667667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4880364 
End bp4881638 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content67% 
IMG OID 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003339999 
Protein GI271965803 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATC TGATGAGCCG ACTGATCGTC GGCGCCCTTG TCCTGGGGGT CGGAGCGGGA 
CTGGTCTTGC ACTACCAGTT CGCCTCAAGC CGCGAGGAGA TCGTCGCGGT CCTGGAGACG
GTGACGCACC TCTTCCTCAA TCTGATCAAG ATGGTGATCG CCCCGCTCAT CTTCGCGACG
ATCGTCGGTG GCATCACCGG CATGGCGAAG GCCACGGGCA TCGGGTCGCT GTTCGCCCGC
TCGATGGTGT GGTTCGTCTC CGCGTCGCTT CTCATCGGCG CCTACGGATT TCTCGCGGCG
CACGCCATGG GCGTGGGGGA CGGGCTGAAC CTGACCCCGG CGGCAGGCGG GGCCGGCATC
GAGACCGAGC CGGTCACGCC GGCCACGTTC GTCGAGGGAC TTGTCCCGCA GAGCTTCATC
GAGGCGTTGG CCTCCAACAA GCCGATCCAG ATCCTTGTGT TCTCAATGTT CTTCGGGGTC
GCCTTGCTCG CCCTCAAGTC CGCCAACGGC GACTCCCGTC TGGCCGACGC GATCGACGAG
CTCACCAACG TAATGCTCAA GGTCACCGGA TACGTGATGG CGCTCGCACC CATCGGCGTC
TTCACCGCCG TCGCCGCGGC ACTCACCGCG GAGGGTGTCG GCGCCTTCGC CACGTACGGG
TCGCTGATCG TCAGTTTCTA CACCGCACTG GCAGGCCTGT GGGCCGCCCT GATCGCCGTG
GGGGCCCTGT TCCTCGGCCG CGGAGTGCTC CGGCTGCTCG CCGCGGTGCG CGAGCCCATG
TTCATCGCGT TCTCGACATC GAGCACGGAG GCCGCGTTCC CCAAGATGAT CAGCTCGCTG
ACGTCCTACG GTGTCGATCG GCGGACGACC GGCCTGATCC TTCCGCTGGG CTACGCGTTC
AACATCGACG GCTCGATGAT GTACATGATG TTCTCGTCGG TGTTCCTGGT CAACGCCTAC
GACATCGACA TGCCCCTCGC CCAGCAGATC CTGATGTGCC TCGTCCTGCT GGTGAGCAGC
AAGGGCATGG CCGGCGTGCC GCGCGGCGCG CTCGTGATCA TCGCCGCGGT CGTTCCCGGC
TTCGGTGTCC CGGCGGCCGG CGTCGCGCTG CTGCTGGTGA TCGACCAACT GCTCGACATG
GGCCGGACCG CGACGAACAT CCTCGGCAAC GCCGTCGCCG TCGCCGTCCT CGGCCGCGGC
ACGACCGGCA CCACGACCCA CGGAACAACA CGAGCCGGCG ACGTTCCCGC GGCGGCCACC
GAACCGGTGC GCTGA
 
Protein sequence
MKNLMSRLIV GALVLGVGAG LVLHYQFASS REEIVAVLET VTHLFLNLIK MVIAPLIFAT 
IVGGITGMAK ATGIGSLFAR SMVWFVSASL LIGAYGFLAA HAMGVGDGLN LTPAAGGAGI
ETEPVTPATF VEGLVPQSFI EALASNKPIQ ILVFSMFFGV ALLALKSANG DSRLADAIDE
LTNVMLKVTG YVMALAPIGV FTAVAAALTA EGVGAFATYG SLIVSFYTAL AGLWAALIAV
GALFLGRGVL RLLAAVREPM FIAFSTSSTE AAFPKMISSL TSYGVDRRTT GLILPLGYAF
NIDGSMMYMM FSSVFLVNAY DIDMPLAQQI LMCLVLLVSS KGMAGVPRGA LVIIAAVVPG
FGVPAAGVAL LLVIDQLLDM GRTATNILGN AVAVAVLGRG TTGTTTHGTT RAGDVPAAAT
EPVR