Gene Sros_7448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7448 
Symbol 
ID8670769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8231990 
End bp8233870 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content69% 
IMG OID 
ProductABC-type dipeptide transport system periplasmic component-like protein 
Protein accessionYP_003342874 
Protein GI271968678 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.338891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGACAG GCAACTCGCG GCGCAGGCTG GCCCTTGCCG CTTGCGCGCT GAGCGGTTCA 
CTGGTGGCCG CTGCCACCAC CTTCGGCGCC GCAGCCGCCG CAGCCGCCGC AGCCGCCGCG
CCGGACGCGT CCGTGTCCGC GTCGCAGGAG ACCACGCTGC GGGTGAAGAT GTCGGGCGCG
GGCGTCGACA CGCTCAACCC GTTCCTCGCC TTCTTCAACG GCGCCCTGGA CATCTTCGGC
TCGATCTACC CCACGCTGAA CTCGCTGGAC GAGAACGGCA AGCCGGGCCC GTACCTGGCC
GAGTCGTGGA CGCCGTCGGA GGACAAGCTC ACCTGGACCT TCAAGCTCAA GGACGGTCTG
AAGTGGAGCG ACGGCAAGCC GATCACGGCT GAGGACGCCG CCTGGACCCT CAACCTGATC
ATGACCGACA CGGTCGCGGG CACCGCCAAC GGCTCGCTGG TCGGCAACTT CGAGTCCGTC
ACCGCGACCG ACCCTACGAC GCTGGTCATC AAGACCAAGG CGCCGCAGGC CAACGTCGTC
TACGTGAGCA TCCCGATCAG CGGCATCCCG ATCGTGCCGA AGCACATCTG GGAGCCGCGG
GCCAAGAACC TCAAGGACGC CAAGAACGAC ACCTTCCCCG TCGTCGGCTA CGGCCCGTGG
ACGCTCACGG ACTACAAGCC CGAGCAGTAC GCGAAGTTCG ACGCCAACAA GGACTTCATC
CTCGGCAGGC CCGGCTTCGA CCACATGATC CAGCAGAGCT TCAAGTCGAC CGACGCCGCC
GTCGCCGCGC TGCGCAGCGG CCAGCTGGAC TACATCAACG CCGTGAACCC GACCCAGTTC
AAGGCGCTGC AGGCGGACAA GAGCCTGCTG ACCGCCCAGG AGGTCGGCAA CGGCTGGACC
GGTGTCGAGG TCAACCACAA CGCCCGCACC CGCACCGGCA AGAAGATCGG CACCGGCCAT
CCCGCGCTCG GCGACCCGGT GCTGCGCCGC GCGATCTCGC TGGCCACCGA CAGGAAGACG
CTGGTCACCA AGGTGCTCGA CGGCATGGGC GTGGCCGGCT CCGGCTACCT GCCCCCGGCC
TGGCCGCAGT GGAGCTGGAA GCCGGCCGCG GGCCAGGAGA CGCCGTTCGA CCTCGCCCAG
GCCGGCAAGA TCCTCGACGA CGCCGGCTAC ACCAAGGGCG CCGACGGCGT CCGCGTCGAC
CCCAAGAGCG GCAGGCCGCT GGAGCTGCGC CTCGGCATCC ACTCCGACGA CACGGCCGAC
GCGGGCATCT CCACCTACCT CAAGGGCTGG CTGGAGACGA TCGGGATCAA GCTCAAGATC
CAGACGCTGA GCATGAGCGC GCTCAACAGC GACCTGGCCA AGGGCGACTG GGACCTGCTG
ATGGACGGCT GGACCACCGG CCCCGACCCG ACCTACCTGC TCGGCATCCA GACCTGCGCC
ACGCTGCCCA AGGACGACGG CACCGGCGGC AACACCGACG CCTTCTTCTG CGACGAGGCC
TACGACGAGC TGTTCAAGAA GCAGCTCACC ACGTTCGACC AGGACGAGCG GGCCAAGGTC
GTCGCCGAGA TGCAGGACAT CCTGTACAAG GCCGACGTCG ACCAGATCCA CTTCTACGCC
AACACCCTCG ACGTCGCCCG CACCGACACC GTGACCGGCC TGATCACCGG GCAGCCGGAC
GCGCAGGGCA TGTACCCGGC GCAGACCGCG TTCTGGAGCT ACCTCAAGGC CGCGCCGCCC
GCCGCCAAGC CGGCCGCCGC CTCGGCCGGG GAGGAGAGCG GCGGCGGCCA GCTGTGGGTC
GGCGCGGGCG TCCTGCTCCT CGCGCTCGTC GGCGGCGGGA TCGTGCTCAG GCGCCGCTCG
GGCGCGGGCG ACCGTGAGTA G
 
Protein sequence
MGTGNSRRRL ALAACALSGS LVAAATTFGA AAAAAAAAAA PDASVSASQE TTLRVKMSGA 
GVDTLNPFLA FFNGALDIFG SIYPTLNSLD ENGKPGPYLA ESWTPSEDKL TWTFKLKDGL
KWSDGKPITA EDAAWTLNLI MTDTVAGTAN GSLVGNFESV TATDPTTLVI KTKAPQANVV
YVSIPISGIP IVPKHIWEPR AKNLKDAKND TFPVVGYGPW TLTDYKPEQY AKFDANKDFI
LGRPGFDHMI QQSFKSTDAA VAALRSGQLD YINAVNPTQF KALQADKSLL TAQEVGNGWT
GVEVNHNART RTGKKIGTGH PALGDPVLRR AISLATDRKT LVTKVLDGMG VAGSGYLPPA
WPQWSWKPAA GQETPFDLAQ AGKILDDAGY TKGADGVRVD PKSGRPLELR LGIHSDDTAD
AGISTYLKGW LETIGIKLKI QTLSMSALNS DLAKGDWDLL MDGWTTGPDP TYLLGIQTCA
TLPKDDGTGG NTDAFFCDEA YDELFKKQLT TFDQDERAKV VAEMQDILYK ADVDQIHFYA
NTLDVARTDT VTGLITGQPD AQGMYPAQTA FWSYLKAAPP AAKPAAASAG EESGGGQLWV
GAGVLLLALV GGGIVLRRRS GAGDRE