Gene Sros_4459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4459 
Symbol 
ID8667753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4970598 
End bp4971623 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content73% 
IMG OID 
Productaminotransferase class V 
Protein accessionYP_003340071 
Protein GI271965875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000253677 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00260881 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCTCG CCCAGGCGCA AGCACTGTGG GACCCCCACC CCGGCTGGCT CAACACCGCC 
AGCTACGGCC TGCCCCCACG CCCCGCCTTC GAGGACCTGC AACGCGTCCT GACCGACTGG
CGCACAGGCC GCACCGACTG GAAACCCTGG GACACCGCCA CCGACCGCGC CCGCACCGCC
TTCGCCACCC TCACCGGCGT CGCCGCCACC GACGTCGCCG TCGGCGCCAC CGTCTCCCAG
ATGCTCTCCC CCCTCGCGGC CTGCCTGCCA CCCGGCGCAC GCGTCGTGGC CGCCACCGAA
GAGTTCACCT CCAACCTGTT CCCCTGGGCA GCCTCGGCCG ACCTGCACAC CGCCCCCCTG
GACCACCTCG CCGAAGCCAT CGACTCCCGC ACCCGCGTCG TCGCCTTCAG CCTGGTGCAG
TCGGCCGACG GCCGCCAAGC CCCCCTCCAG GACATCCTCA CCGCCGCCCG CGACCACGAC
ACCCTCGTCG TGGCCGACGC CACCCAAGCC TGCGGCTGGC TGCCCGTGCA AGCCGCCCAC
TTCGACATAC TCGTCTGCGC CGCCTACAAA TGGCTCATGG CCCCCCGCGG CGCCACCTAC
GGCTACCTGT CAGCCCGAGC CCGCCGGCAC ATGCGCCCCA TCGCCGCCAA CTGGTACGCC
GGCGCGGACC CCGGCGGCTC CTTCTACGGA CCACCCCTGC GCCTGGCCGA AGGAGCCCGC
GCCTTCGACC TGTCACCGGC CTGGTTCAGC CAGATCGGCG CCGCAGGCTC CATCGACCTG
CTCAACCGCA TCGGCGTGAC CACCGTCCAC GCCCACAACA CCGCACTGGC CGCCCGCTTC
CTCACCGCCC TGGACCAGCC CCCCACCGGC AGCGCGATCG TCACCGTCGA AGCACCTGAC
GCCCGCCGAC GCCTCGAAGC CGCCGGAATC CGCACCGCCG TACGCGCGGG AAAGATCCGC
GCCTCCTTCC ACCTGTACAC CACCGTCGAC GACGTCGACC GCGCCGTGGA GGCCCTGACC
CGATGA
 
Protein sequence
MDLAQAQALW DPHPGWLNTA SYGLPPRPAF EDLQRVLTDW RTGRTDWKPW DTATDRARTA 
FATLTGVAAT DVAVGATVSQ MLSPLAACLP PGARVVAATE EFTSNLFPWA ASADLHTAPL
DHLAEAIDSR TRVVAFSLVQ SADGRQAPLQ DILTAARDHD TLVVADATQA CGWLPVQAAH
FDILVCAAYK WLMAPRGATY GYLSARARRH MRPIAANWYA GADPGGSFYG PPLRLAEGAR
AFDLSPAWFS QIGAAGSIDL LNRIGVTTVH AHNTALAARF LTALDQPPTG SAIVTVEAPD
ARRRLEAAGI RTAVRAGKIR ASFHLYTTVD DVDRAVEALT R