Gene Sros_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1066 
Symbol 
ID8664340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1089371 
End bp1090576 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID 
Productaminotransferase, class I and II 
Protein accessionYP_003336809 
Protein GI271962613 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.824593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGTC CTCGCATCTC AGCGCGCATC TCCGCGATCT CCGAGTCCGC CACGCTCGCC 
GTGGACGCCA AGGCCAAGGC GATGAAGGCG GCGGGCCGCC CGGTCATCGG CTTCGGCGCC
GGCGAGCCGG ACTTCGCGAC TCCCGACTAC ATCGTCGAAG CGGCCGTCGA GGCGTGCCGG
AACCCCAGGT TCCACAAGTA CACGCCGGCC GGCGGCCTCC CGGAGCTGAA GCAGGCCATC
GCCGACAAGA CGCTCCGCGA CTCCGGCTAC CAGGTCGACG CCGCCCAGGT CCTGGTCACC
AACGGCGGCA AGCAGGCCGT CTACGAGGCG TTCGCCACGC TGCTGGACCC GGGTGACGAG
GTTCTGGTCA TCGCCCCCTA CTGGACGACC TACCCCGAGG CCATCAAGCT GGCCGGCGGC
GTCCAGGTCG ATGTCGTGAC CGACGAGACC ACCGGCTACC TCGCGAGTGT GGAGCAGCTC
GAAGCCGCCC GCACCGAGCG CACGAAGGTC CTGCTGTTCG TCTCCCCGTC CAACCCGACC
GGCGCGGTCT ACACCCCCGA GCAGGTCGAG GCGATCGGCC GCTGGGCCGC CGGGCACGAC
CTGTGGGTCG TCACCGACGA GATCTACGAG CACCTCACCT ACGGCGACGC GACCTTCGCC
AGCATCGCCA CCGTCGTCCC CGAGCTGGGC GACAAGGTCG TCGTCCTCAA CGGCGTCGCC
AAGACCTACG CGATGACCGG CTGGCGGGTG GGCTGGCTGA TCGGCCCCAA GGACGTCGTC
AAGGCCGCGA CCAACCTGCA GTCGCACGCC ACCTCCAACG TCTCCAACGT CTCCCAGGCC
GCCGCCCTGG CCGCCGTCAC CGGCGACCTG GCGGCCGTGG CGATGATGCG CGAGGCCTTC
GACCGCCGGC GCCAGACCAT GGTCCGCATG CTGAACGAGA TCCCGGGCGT GCTCTGCCCC
GAGCCGAAGG GCGCGTTCTA CGCCTACCCG TCGGTCAAGG AGCTGCTGGG CAAGGACTTC
GGCGGCAAGC GCCCGCAGAC GTCCGCCGAG CTGGCCGAGA TCATCCTTGA GGAGGCCGAG
GTCGCCCTCG TCCCCGGCGA GGCCTTCGGC ACCCCGGGCT ACTTCCGCCT GTCCTACGCA
CTGGGCGACG AGGACCTCGT CGAGGGCGTC AGCCGGGTGG CCAAGTTCCT GGGCGACGCC
CGCTAG
 
Protein sequence
MTRPRISARI SAISESATLA VDAKAKAMKA AGRPVIGFGA GEPDFATPDY IVEAAVEACR 
NPRFHKYTPA GGLPELKQAI ADKTLRDSGY QVDAAQVLVT NGGKQAVYEA FATLLDPGDE
VLVIAPYWTT YPEAIKLAGG VQVDVVTDET TGYLASVEQL EAARTERTKV LLFVSPSNPT
GAVYTPEQVE AIGRWAAGHD LWVVTDEIYE HLTYGDATFA SIATVVPELG DKVVVLNGVA
KTYAMTGWRV GWLIGPKDVV KAATNLQSHA TSNVSNVSQA AALAAVTGDL AAVAMMREAF
DRRRQTMVRM LNEIPGVLCP EPKGAFYAYP SVKELLGKDF GGKRPQTSAE LAEIILEEAE
VALVPGEAFG TPGYFRLSYA LGDEDLVEGV SRVAKFLGDA R