Gene Sros_3367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3367 
Symbol 
ID8666655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3696694 
End bp3698013 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content70% 
IMG OID 
Productputative cytosine permease 
Protein accessionYP_003339049 
Protein GI271964853 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.130336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0781791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATG AGGACTACCC GCTGGAGCGG GTTCCGCAGG CTGCCCGCTA CTCGTGGTTC 
AACGTGGCCG TGCAGCGCTT CGGCCAGCTG TCGGACCTCA CGCAGTTCCT GCTCGGCGCG
ACGCTGGGCG CGGGGCTGTC GTTCTGGGGA GCGTTCTGGG CGTTCACGCT CGGCTCGGTG
ATCCTGGAGA TCGTCTGCAT CTTCGTCGGC ATCGCCGGGA TGCGCGAGGG CCTGTCCACC
TCCGTGCTGG CCCGCTGGAC CGGGTTCGGC CGGTACGGCT CCACGCTGAT CGGCGTGATC
ATCACGCTGA GCCTGTTCGG CTGGTTCGGC GTGCAGACCG CGGTCTTCGC GGCGGGCCTG
CACGCCATCA TGAGCGGCAT CCCCCTGTGG GCCTGGTCCC TCATCGCCGG CCTCGGCGTG
ACCGCCCTGG TCCTGAAGGG CTTCCGGGCG ATGGGCTGGA CGGCCTTCGT CACCGTGCCC
GCGTTCCTGG GGCTGGCGGG CTGGGCCATG TGGGTGGAGG TCTCCCGGCA CAGCCTGGGC
GAGCTGATCT CCTCCTCCCC CTTCGGCGCC CCGATCACGG TCGCGACCGG GGCGACGATC
GTCGCCGGTT CCTACATCGT CGGCGCGGTC ACCACCCCGG ACATGACCCG GTTCAACCGC
AGCACCTCCG ACGTGGTCAA GCAGACGCTG GTCGGCATCT CCCTCGGCGA GTACGTGCTC
GGCCTGGCCG GGGTGCTCCT GGCCTACGCC GTCAAGACCT CCGACATCGT CGCGATCATC
ACGGCCTCCT CCGGAGTCGT CGGCGTCGTC ATCCTGGTCT CGGCCACCGT GAAGATCAAC
AACTGGAACC TCTACTCGGC GGCGCTCGGA CTGATGAACG CCGTGGAGTC CACGGTCGGC
GTACGGCTCA ACCGGGTCGC CGTCACCGTC GGCATCGGCC TGCTGGGCAG CATCGCCGCC
GCCGCCGGGA TCCTGGACGC CTTCGCCGGA TTCCTGTTCG TCCTCGGCGT CGTCACCCCG
CCGATCGCCG GGATCATGGT CGCCGAGTAC TTCGTGGTCA AGCGCTGGCG GCCCGTCCTG
GACGCCTCCC GCGAGCTCGG GCGGCTGCCC GAGACCGAGC CCGCCTGGGT GCCCGCCACC
ATCGCGATCT GGGCGGCCGC GGCCCTGGTC GGCTGGCTGA GCGACGCCTA CGGGTGGATC
GGCATCCCCG CGCTCAACTC GCTGATCCTG GCCGGGCTCG GCTACATCGT CGCCGGGAAG
CTCGGCCTGG TCCGCGGGAC GCGGGAACTC CCCGTCGACC AGCCTGCCGT ACACGTCTGA
 
Protein sequence
MIDEDYPLER VPQAARYSWF NVAVQRFGQL SDLTQFLLGA TLGAGLSFWG AFWAFTLGSV 
ILEIVCIFVG IAGMREGLST SVLARWTGFG RYGSTLIGVI ITLSLFGWFG VQTAVFAAGL
HAIMSGIPLW AWSLIAGLGV TALVLKGFRA MGWTAFVTVP AFLGLAGWAM WVEVSRHSLG
ELISSSPFGA PITVATGATI VAGSYIVGAV TTPDMTRFNR STSDVVKQTL VGISLGEYVL
GLAGVLLAYA VKTSDIVAII TASSGVVGVV ILVSATVKIN NWNLYSAALG LMNAVESTVG
VRLNRVAVTV GIGLLGSIAA AAGILDAFAG FLFVLGVVTP PIAGIMVAEY FVVKRWRPVL
DASRELGRLP ETEPAWVPAT IAIWAAAALV GWLSDAYGWI GIPALNSLIL AGLGYIVAGK
LGLVRGTREL PVDQPAVHV