Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4297 |
Symbol | |
ID | 8667591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4799471 |
End bp | 4802545 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Glycine--tRNA ligase |
Protein accession | YP_003339929 |
Protein GI | 271965733 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.83812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCTTGA AGAAGACTGA AGCCGTAAGC CCGATCCCCG CCATCATCGA GGTCGCCGCC GTGTTGACGA TGCAGGATGC TCTCCGTGCC CTCACTGACT ACTGGTCGCG TCACGGATGC ACGCTGATAC AGCCGTACAA CTCCGAAGTC GGCGCGGGCA CGCTCAATCC GGCGACCGTA CTGCGGGTGC TCGGTCCGGA GCCGTGGCGG GTGGCATATG TGGAGCCCAG CGTACGGCCC GATGATGCCC GGTACGGAGA AAATCCGAAC CGCCTGCAGA CGCATACCCA GTTTCAGGTG ATCCTCAAGC CGGACCCGGG TGATCCGCAG GAGCTTTATC TGGGGAGCCT GGCGGCCCTG GGCATCGACC TGCATGCGCA CGATGTCAGA TTCGTGGAGG ACAACTGGGC CTCGCCAGCG TTGGGCGCAT GGGGTCTCGG GTGGGAGGTG TGGCTCGATG GCCTCGAGAT CACCCAGTTC ACCTATTTTC AACAGGCCGG CGGGATAACG CTCGATCCCG TGCCGGTAGA GATCACCTAT GGTATCGAGC GGATCATGAT GGCCCTGCAG AGAGTCGATC ATTTCAAGGA GATCCGTTAC GCCACAGGTG TTTCGTACGG TGAGATATTC GGCCAGGCCG AGTACGAAAT GAGCCGCTAC TACCTCGACG AAGCGAGCAC GGACGCCAAC GGGCGCCTCT TCGAGGAATA TGCCGCAGAG GCCCGGCGGA TGCTGGACGC CCGGCTGCCG GTGCCCGCGC ACAATTACGT GTTGAAGTGC TCGCACACCT TCAATGTGCT CGACTCCCGA GGGGTGGTCT CCACCACGGA GCGGGCAGCG GCCTTCAAGC GGATGCGAGG GCTGGCACGG GACGTGGCAC ACCTGTGGGT GGAGCGGCGG GCCGAACTCG GCCACCCGCT CGGAGTGGTC GGTCCTCCGG CGCCGGCCGA GCTGCCCGTA TCGACCGCGG TGCCCGGCAC TCCCGCCACC TTGGTCTTCG AGATCGGCAC CGAGGAGCTG CCGGCCGGCG ACGTGACCCG GACGGCCGCG ACGGTGCGTG ACGCGGTAGC TCGGAAGCTC GCCGCGACGC GGCTCGGCCA CGGTGGGATA CGCACGTACG CCACCCCCCG CCGCGTGGTC ACAATTGTGG AACAGGTGCA GCCGCGCGAG CCGGACAGCG AGCAGACCGT ACGCGGTCCA CGCGTGAGTG TCGCCTACGC CGACGACGGC GAGCCGACCA AGGCCGCGGT CGGCTTCGCC CGCGGCCGGC AGGTGGATGT CGCCAAGCTG CGACGGATCG ATGTGGACGG CGTGGCACAT GTCGGGGTGG TCAGGCCAGA CCCGGGCCGA GACGCCGTGG CAGTTCTCAG CGAGCTCCTG GGACAGGTCG TCGCGGAATT GCGCGCCGAC AAGAACATGC GATGGAACGA CCCGAAGTTG TCGTTCGTCC GGCCGTTGCG GTGGTTGCTC GCACTTCTCG GCGACGTCCC CGTGCCTCTG GCGGTCTCGG CGCTGGCCGG CGGGACCACC ACCCGGGTGC TCCGGAACGC GGCCGAGCCC TCGGTCGAAG TGCCGAGCGC GGACGGCTAC CTGGAGTTGC TGGCAGGAGA AGGGATCGTA CTGGATCCCG CCGAGCGGCG GACCCGGATC GTCAGGGCGG CGCAGCGGCT GGCCAGGGAC GTGGGCGGCG TCGTCGACGT TGAAGGCGAG TCGGCTCTCA TCGACGAGAT CACCAACTTG GTCGAGGAAC CGACGCCGAT CCGGGGCGGA TTCGCCGAGG AGTACCTCGC GCTGCCATCC GAGATCCTCA TGACGGTCAT GCGCAAGCAC CAGAGGTACT TGCCCGTGCG CGCCGCGGAC GGCACGCTGA TGCCGTACTT CATCACGGTC GCGAATGGCT CCTGCGATCA GGAGGTGGTC GGAACCGGGA ACGAAGCGGT GCTGCGCGCT CGATACGAGG ACGCGCTCTT CTTCTGGCGC GCGGATCTTG ACGTGTCCCC GGACAGGTTC CGTGCGGCTC TGGACAAGCT GTCCTTCGCG GAGAATCTCG GCTCCATGGC CGATCGTGCC GATCGGATCG CCGCTGTCGC GCAGGACCTG GGTGCGTTGC CCATGGTCTT CATGACCGAG GCCGAAAGGC GGACCCTGAC CCGCGCCGCC GCACTGGCGA AGTTCGATCT CGCGTCACAC ATGGTGGTGG AGCTCACCAG CCTCGCCGGC ACCATGGCTC GAGAGTATGC CCTCCGGGCC GGCGAACCAG AGCCGGTTGC GCAGGCGCTC CATGAGATGG AACAGCCCCG GTCCGCCGGC GGACAGCTGC CACGCAGCAC CCCGGGCGCG CTGCTCGCAC TCGCGGATCG GCTCGACCTG CTCGTGGGGA TGTTCGGCAT CGGGTCCGGC CCCACCGGCC GGTCCGACCC CTTCGGTCTG CGCCGAGCGG CATCGGGCCT GGTGAGCATC CTCCGCGACC ACACGGCTCT ACGGTCGATG GACGTGAGCG CCTGCCTGGC CGTCGCCGCG GAACACCTCC GGGCGCGAGC CGTCGACGTT CCGGCGACGA CGCTGGACGA GATCCGGGAA TTCGTCACCC GCCGGTACGA GCAGCAACTT CTGGACTCCG GACAGGACCA CCGGTACGTC GCCGCTGTGC TCCCCCTGGC CGGCAGTCCG GCCCGAGCAG ACGAAACGCT TGCCGAGCTG CACAAGCGCG CGAAGGCACC CGACTTCGCC GAGCTTGTTG CCGCTCTCCT GCGAGTGAGA CGGATCGTGC CGGCGGGCAC CGCGGCGGGA TACGACGCGA GCCGGCTGGT CGAGCCCTGT GAGCTCTCCC TCCACCAGGT CCTGGGCGAG GTACGGACCG CCCTGGGCGA GCAGCCCACG CGGTTGCGGG ACTTCGTCGA TACCGCTGCC GCGCTCGTCG GACCGGTCAA CACCTTCTTC GATGAGGTCC TGGTCATGGA TGAGGACGCC GACCGGCGTG CGACCCGGCT CGGTCTCCTC GCCCACATCC GCGATCTGTC GGCAGGTGTC CTCGACTGGT CCGCCCTGGG CGCCATGCCG GCCGCCGGCA AGTAG
|
Protein sequence | MCLKKTEAVS PIPAIIEVAA VLTMQDALRA LTDYWSRHGC TLIQPYNSEV GAGTLNPATV LRVLGPEPWR VAYVEPSVRP DDARYGENPN RLQTHTQFQV ILKPDPGDPQ ELYLGSLAAL GIDLHAHDVR FVEDNWASPA LGAWGLGWEV WLDGLEITQF TYFQQAGGIT LDPVPVEITY GIERIMMALQ RVDHFKEIRY ATGVSYGEIF GQAEYEMSRY YLDEASTDAN GRLFEEYAAE ARRMLDARLP VPAHNYVLKC SHTFNVLDSR GVVSTTERAA AFKRMRGLAR DVAHLWVERR AELGHPLGVV GPPAPAELPV STAVPGTPAT LVFEIGTEEL PAGDVTRTAA TVRDAVARKL AATRLGHGGI RTYATPRRVV TIVEQVQPRE PDSEQTVRGP RVSVAYADDG EPTKAAVGFA RGRQVDVAKL RRIDVDGVAH VGVVRPDPGR DAVAVLSELL GQVVAELRAD KNMRWNDPKL SFVRPLRWLL ALLGDVPVPL AVSALAGGTT TRVLRNAAEP SVEVPSADGY LELLAGEGIV LDPAERRTRI VRAAQRLARD VGGVVDVEGE SALIDEITNL VEEPTPIRGG FAEEYLALPS EILMTVMRKH QRYLPVRAAD GTLMPYFITV ANGSCDQEVV GTGNEAVLRA RYEDALFFWR ADLDVSPDRF RAALDKLSFA ENLGSMADRA DRIAAVAQDL GALPMVFMTE AERRTLTRAA ALAKFDLASH MVVELTSLAG TMAREYALRA GEPEPVAQAL HEMEQPRSAG GQLPRSTPGA LLALADRLDL LVGMFGIGSG PTGRSDPFGL RRAASGLVSI LRDHTALRSM DVSACLAVAA EHLRARAVDV PATTLDEIRE FVTRRYEQQL LDSGQDHRYV AAVLPLAGSP ARADETLAEL HKRAKAPDFA ELVAALLRVR RIVPAGTAAG YDASRLVEPC ELSLHQVLGE VRTALGEQPT RLRDFVDTAA ALVGPVNTFF DEVLVMDEDA DRRATRLGLL AHIRDLSAGV LDWSALGAMP AAGK
|
| |