Gene Sros_1267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1267 
Symbol 
ID8664542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1301400 
End bp1302815 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content70% 
IMG OID 
Productprotein of unknown function DUF201 
Protein accessionYP_003337008 
Protein GI271962812 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACTA ATATCCCGTT CGGGGAGAAG CATTCCAGAT TAAGCATCTT CGAACCGAAC 
GAGGGGACCC TGGTGGTCAT CCCGTCACTC TCGCTTCCCC AGGACGAGCT GCGCAGGATC
ACCGGCGCCC TGTGCTACGA GGAGCGGCTG CTGTTCCTCC TGCTCACCCT CAGGCAGCCG
GACGTCGAAG TCGTCTACCT GTCCTCCGTT CCGGTGGACA CCGCGATCGT GGACTACTAC
CTGGGCTTCC TCGACGACCC CGACGAGGCG CGCACCCGCC TGCAAATGAT CAGTTTGGAC
GAGCCGCGCA CGGGGCCGCT CACCATGTCG CTGCTGCACC GCCCGGATGT CATCGCGCGG
ATCCGCGCGG CGCTCGGCCG TACGGCGGGC GCCTGGATGG TGCCGTTCGT GGTCAGTGAG
GCCGAGGAGC GGCTCGCCGA GATCCTCGGC CTGCCGATCT ACGGCCCGGC CACCTCGCTG
GCCCACCTCG GGTCCAAGAG CGGCGGACGC ATGATCGCCG AGGAGGCGGG GGTGCCGATG
GCCCGGGGCT TCGCCGACCT GCGGTCGCTG ACCGAGGTGG AGCACGCGGC CCGCGCGCTG
AGCCCCAGGT CCAAGCTGAT GGTCAAGCTC AACAACAGCT ACTCCGGGCT GGGCAACGCC
GTCGTGATCA AGGACGAGCG GCCGCTCACC GCCTGCCACA CGAGCTTCTC GGCGGCGGAC
GAGAACTGGA CGACGTTCGC CGAGAAGATC GCCGAGCGGG GCGCGGTGAT CGAGGAGTTC
ATCGAGGACC GGCCGCTGCA CTCCCCCAGC GCCCTGGCCA GGATCACCCC CGGCGGCGCC
TATGACGTGG TCGCCACCCA CGAGCAGCTT CTCGGCGGCC CGAACGGCGA CCTCTACCAG
GGCTGCGCCT TCCCCGCCCG GCCGGAGTAC CGGGCCCAGG TGGGCGAGTG CGCCGAGCGG
ATCGCCCGGG TCCTCGCGGG CCGGGGCGTG GTGGGCCTGT TCGGCATGGA CTTCTTCGCC
GTCAAGACCG ACGCCGGCTA CCGGGCCCTG CTGTGCGAGA TCAACCTGCG GATCGGGGGC
ACCACGCACC CGTTCGGCGC CGCCCTGCTC ACCACCGGCG CCTCCTACGA TCCCGGCACC
GGCACGCTCG TGCACGGCGG CCGGTCGAAG TACTACGTGG CGACCGACAA CTGCACCGCC
GCCTGCCTGC GGGGCCGTAC GCCCGCGGAG GTCGTCAAGC TGATCGACGA CAGGGGTCTC
GGCTTCGACC GCGAGGCCCG CACGGGCAAC GTGCTGCACC TGCTCGGCGC GGTCCCGGAG
TACGGCAAGC TCGGTTTCAC CAGCATCGGC GACTCGGCCG AGGAGGCCGC CGAGCTACAC
CGGAGGACCC TGCGGGCGCT TAACCAGTCC GCGTAG
 
Protein sequence
MITNIPFGEK HSRLSIFEPN EGTLVVIPSL SLPQDELRRI TGALCYEERL LFLLLTLRQP 
DVEVVYLSSV PVDTAIVDYY LGFLDDPDEA RTRLQMISLD EPRTGPLTMS LLHRPDVIAR
IRAALGRTAG AWMVPFVVSE AEERLAEILG LPIYGPATSL AHLGSKSGGR MIAEEAGVPM
ARGFADLRSL TEVEHAARAL SPRSKLMVKL NNSYSGLGNA VVIKDERPLT ACHTSFSAAD
ENWTTFAEKI AERGAVIEEF IEDRPLHSPS ALARITPGGA YDVVATHEQL LGGPNGDLYQ
GCAFPARPEY RAQVGECAER IARVLAGRGV VGLFGMDFFA VKTDAGYRAL LCEINLRIGG
TTHPFGAALL TTGASYDPGT GTLVHGGRSK YYVATDNCTA ACLRGRTPAE VVKLIDDRGL
GFDREARTGN VLHLLGAVPE YGKLGFTSIG DSAEEAAELH RRTLRALNQS A