Gene Sros_6607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6607 
Symbol 
ID8669916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7273856 
End bp7275490 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content75% 
IMG OID 
Productcarbon-nitrogen family hydrolase 
Protein accessionYP_003342062 
Protein GI271967866 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.228741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.120633 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGCG TCGCCGCGGT GCAGTTCGCA ACCGGCCTGG ATGTGACCGC CAACCTCGCG 
ACCTGCCTGC GAATGATCGA CTCCGCCGCC GGGCAGGGGG CCGAGCTGAT CGTGCTGCCC
GAGTTCTGCA ACCACCTGTC CTGGTACGAG AGCCGCGACC ACGCGCGCCG CCTGGCCTGC
CGCCACGGAG ACCCGTTCCT GAGCGCCGTC GCCGGGCGCG CCGCCCGGCA CCGCGCGCAC
GTCAAGCTCG GCGTCACCCT GGCCCGCGAG GACGGCCGCG TCACGGGGGC CTCCCTGCTG
TACGGACCGG ACGGCGCGCT GCTAGGCGAG GCCGACAAGC AGGTGCTGAT GGGCGCGGAG
AACGACCACC TGGACCCGGG AACGGCCGTC GGCCCGGTGG TGGCGACCGC GGTCGGCCGG
CTCGGCATGT ACGCCTGCAT GGAGGGCGTG ATCAGCGAGA TCACCCGCGG GCTGGCGCTG
CGCGGCGCGC AGGTCCTGCT CAACAGCCTG AACTCCTTCG CCGTGGACGA GGCGAGCCTG
CACGTGCCGG TCCGCGCCGC GGAGAACAGG GTGTGGGTGG TCGCCGCCAA CAAGGTCGGG
CCGCTGCTGC CCGCCGACAG GATCGAGCTG ATCGGCGCCG GGCTCGGCGT CCCGCCCGAG
TGGCTGCACG GCGCGGGTGA GAGCCAGGTC GTCGCCCCCG ACGGCACCGT CGTGGCCAGA
GCGCCAAGGA CCGGCGAGGC TGTCGTGGTG GCCGACGTCG ACGTGGCCCT GGCCGACGAC
AAGGTCCGGC CCGACGGCAC GGACGTGCTG GCCGCCCGCC GCCCCGCCCT CTACCGGCCG
ATCGCCGCCG AACCCCGCGG CCGTACGGCT CCCGCCGGAG CGGGCAGCGT GGCCGTGGCC
GTGGTCAGGC CGTGCGCGGG GCTCGGAGGC GCGACGGAGC TCATCAGGCG GGCGGCCGAG
AGCGGGGCGG AGCTGCTCGT GCTGCCCGAG CTGTGCGGGG TGACGGCCGA GGAGGCGGCG
CGGGCCGTAC GCGGGACCAC CGCGCACGTG GTGCTCAGCG AGATCCGCGA CCGGGCGCAC
GACGGGCTGC TGGTCTCGGC CGACGGGATC ATGGGACGGC AGCGCAAGCT CCACCCGTCC
GCGCGGCAGG CCGGGCGGGT CACCGCGTTC GGGGACGGGC TGGAGGTCTT CGAGCTGCCG
TGGGGAAGGC TGGCCATCAT CGTCGGCGAT GACACGATAT TTCCGGAAAC GTTCAGGCTG
GCGGCGCTGG CCGACGCCGA CGTCGTCGCG GCGCCCCTCA CCCCGTCCGA GCCCTGGGAA
CTCCGGTCCG GCCTGCTGGA ACGGGCCGCG GAGAACCGGC TCAACGTCGT CGCCGCCGGA
CACGACGGGC CCGGCGGCCT CGCCGGCGCC ATCCTGGCCG CGCCGCGGGA CTTCACGCTC
TGGACCGCCT GGGAAGGCCC GTTCACCGGC CGCATCAGCC ACCCGATCGT CACCCCGGTC
AGGAACGACG ACCGCGTGGT GCGCGCCGAC GTCCACCCTG CGCAGGCCGT CAACCGGCAC
GTCTCACGCG GCACCGACCT GGTGGACGGC AGACCGTGGC GGCTCGTCGG CGCGCTCCTG
GAAGGAGACA CGTGA
 
Protein sequence
MVRVAAVQFA TGLDVTANLA TCLRMIDSAA GQGAELIVLP EFCNHLSWYE SRDHARRLAC 
RHGDPFLSAV AGRAARHRAH VKLGVTLARE DGRVTGASLL YGPDGALLGE ADKQVLMGAE
NDHLDPGTAV GPVVATAVGR LGMYACMEGV ISEITRGLAL RGAQVLLNSL NSFAVDEASL
HVPVRAAENR VWVVAANKVG PLLPADRIEL IGAGLGVPPE WLHGAGESQV VAPDGTVVAR
APRTGEAVVV ADVDVALADD KVRPDGTDVL AARRPALYRP IAAEPRGRTA PAGAGSVAVA
VVRPCAGLGG ATELIRRAAE SGAELLVLPE LCGVTAEEAA RAVRGTTAHV VLSEIRDRAH
DGLLVSADGI MGRQRKLHPS ARQAGRVTAF GDGLEVFELP WGRLAIIVGD DTIFPETFRL
AALADADVVA APLTPSEPWE LRSGLLERAA ENRLNVVAAG HDGPGGLAGA ILAAPRDFTL
WTAWEGPFTG RISHPIVTPV RNDDRVVRAD VHPAQAVNRH VSRGTDLVDG RPWRLVGALL
EGDT