Gene Sros_5701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5701 
Symbol 
ID8668995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6234438 
End bp6236261 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content67% 
IMG OID 
ProductABC-type dipeptide transport system periplasmic component-like protein 
Protein accessionYP_003341192 
Protein GI271966996 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.167771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAC TAGCAGCTGT AGCGGCGTTG CTCGGCCTCG CCGCCGTCAC GGCGTGCAGC 
GGCAGTACCG GAGACAAGCC CGCCGCCAGC GGCGGCAGCG GCGGCAGCGG CGGCGGTCTC
TACACCACGA TCGACGGACT CAAGCCCGGC CTCGACGTCA ACGGGCCGAT CAACCCGTGG
AACCCCAAGG GCAACGCGTT CGTCGGGCTC AACGCCATGC GGATCGCCTG GTCGAAGAAC
CACATGACCG ACCCGAACCA GTTCTACCCG GGCATCGCGG AGAGCTGGGA GATCGCGCCG
GACAACTCCT CGATCACGCT GCACCTGCAC CCGGACAACA AGTGGTCCGA CGGCAAGCCG
GTCACCGCCG AGGACGTGAA GTTCTCCATC GCCCTGGCCT ACACGCAGGG CAGCACCGCC
TTCGCGATCG ACCCGGGCGC GGCGGGCGCA GCCTCCGAGG TCGAGGTGGT CGACGACAAG
ACCGTCAAGA TCACCCAGGA CATGGACAAC CCCAGCGTCA CGTTCGTGCG CGGCGTCATG
GACAGCTACA TCGTGCCCCA GCACGTCTGG AGCAGCGTGC TGACCGCCGA TTTCTGGGAC
AAGCTGAAGG CCGCCCGCGG CGAGGGTGCC GAGGCCGAGA AGGCCCGCGA GGAGATCACC
GCGCTGTCGG AGAAGGTCCT CGCCTTCGCC CCGCCCAAGG ACGTCTCCGC CGGCCCGTTC
ACGCTGGAGC GGATGAACCC GAGCGAGGCG CTGCTGGTCA AGAACAAGAA CTTCTACAAC
GCCGCGAACG TCGGGCCCGA CCAGGTCAAG CTGCTCAACT ACACCGGCAA CGAGCAGATC
TGGAACTACC TCATCGCCGG CAAGCTCGAC AACGCGCCGT TCACCGCCGT GCCCGCCGAC
GTGATGAAGC GCATCAGCAG CACCCCGGGC AACGGGGTGA TCAAGGGCTA CTCGCCGGTG
TCGCTGGGCA TGGCCTTCAA CCAGGCCAAG AAGCCCTACG ACAACGTGCA CGTGCGGCGC
GGCCTGGCCT ACCTGATCAA CCGGGACGAG ATCACCAAGA TCGCCTCGCC GGAGGGCGGC
ACCCCGGCGC TCACCACCAC CGGTATCCAC CAGAAGCCCG CCGCGGAGTG GCTCGGCGCC
GACCTCGCCA CGCTGGAGCC GTACAAGCTC GACGCGGCCA AGGCCGAGGA GGAGTTCAAG
AAGGCGGGCC TGAAGAAGGA CGGCGGCAAG TGGACGCTGC CCGACGGCAC GCCGTGGAAG
TTCACCGTCA ACGTCCCGGC GCCGTTCTCC GACTGGATCT CCGGCGCCAA GGCGATCACC
AGCCAGCTCA CCGAGGCGGG GATCGACGCC GAGGTCGTGA CCACCGCCGA CTACCCGCTG
TACCTCAAGG AGATCGCCGA GGGCAAGTAT GACGTCGGGT TCTGGCTGAT CGCGCTCGGC
CCCGCGCCGT ACAACATCTA CCAGCGCCTC TACGGTGCCT CCAACGGGTG GTCCATCCTC
GGCGGCAAGA TCAAGCACGC CGAGCCCGGC AAGAACGGCA ACTGGATGGG CGGCCCGGAG
ACCATCGAGG TCGACGGGGC CAAGGTCAAC CCCGGTGAGC TCACCGCCAA GCTGAACTCC
GCCTCCGGCG ACGAGCAGAA GAAGATCATC GGCCAGCTCG CCAAGGCGGC CAACCAGGAC
CTGCCGGTGG TCCAGCTCTG GGACTACGTC AACACCCAGT TCGTCAACAC CAACCGCTTC
TCCGGCTTCC CCGAGAACGA CAGCGACCTG CTCCGCCAGC CCTCCGGCGT GTGGATCCAG
CTCGGCATGG TCAAGAAGCA GTAA
 
Protein sequence
MKRLAAVAAL LGLAAVTACS GSTGDKPAAS GGSGGSGGGL YTTIDGLKPG LDVNGPINPW 
NPKGNAFVGL NAMRIAWSKN HMTDPNQFYP GIAESWEIAP DNSSITLHLH PDNKWSDGKP
VTAEDVKFSI ALAYTQGSTA FAIDPGAAGA ASEVEVVDDK TVKITQDMDN PSVTFVRGVM
DSYIVPQHVW SSVLTADFWD KLKAARGEGA EAEKAREEIT ALSEKVLAFA PPKDVSAGPF
TLERMNPSEA LLVKNKNFYN AANVGPDQVK LLNYTGNEQI WNYLIAGKLD NAPFTAVPAD
VMKRISSTPG NGVIKGYSPV SLGMAFNQAK KPYDNVHVRR GLAYLINRDE ITKIASPEGG
TPALTTTGIH QKPAAEWLGA DLATLEPYKL DAAKAEEEFK KAGLKKDGGK WTLPDGTPWK
FTVNVPAPFS DWISGAKAIT SQLTEAGIDA EVVTTADYPL YLKEIAEGKY DVGFWLIALG
PAPYNIYQRL YGASNGWSIL GGKIKHAEPG KNGNWMGGPE TIEVDGAKVN PGELTAKLNS
ASGDEQKKII GQLAKAANQD LPVVQLWDYV NTQFVNTNRF SGFPENDSDL LRQPSGVWIQ
LGMVKKQ