Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3907 |
Symbol | |
ID | 4598042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4112019 |
End bp | 4113107 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639778513 |
Product | hypothetical protein |
Protein accession | YP_925092 |
Protein GI | 119718127 |
COG category | [A] RNA processing and modification |
COG ID | [COG5178] U5 snRNP spliceosome subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.837728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGGGCG ACCAGTACCC TCCCCCGGGC GGATCGCCCC CGAGCCCGCC CCCGCCCGGA TGGCAGCCAC CGCCGCCCCC GACACCCGTC CCGCCGGCGC CCGGCTGGGC GCAGTCGCCA GTCGTCGCGC CGGGCATGCT GGGCGCCGCA CACAAGCCGG GTGCGATGCC GCTGCGCCCG CTCGCGCTCG GCGACATGTA CGACGCGGCG TTCCGGATCA TCCGGTTCAA CCCGAAGGCC ACGGTCGGCT CCGCGGTGCT CGTCGCGGCG GTCGCGATGG GCGTCCCGGT CCTGGTCACC GCGCTGCTGA CGCTGGTCGT GGATCTCTCC GCCGCACAGT CCGGCGACGA CCTCTCCACT GCCGAGGTCG TCGGCTACGC CGGGTCGATC GGCTCGCTGT TCCTCGGCAC CGCGCTGCAG TCGGTGGGCT CGATCCTGGT CACCGGCATG GTCGCCCACG TGACGGCGGC CGCGGCGATC GGCCGCCGGC TCACCCTCGG CGAGGCCTGG GCCGCGACCC GGGGCTCGCG CTGGCGGCTG GTCGGGCTGA CACTGCTGCT GGGCCTGATG CTGGCCGGGC TGCTGCTCGC CTACGGCCTG CTCTGGATCC TGGCGGTCGT GCTGCTGCCG ACCTGGGCGA TCGTCCTGTT CGGCGTGCTC AGCGTCCCGG CGTTCCTCGC GTTCGCCTGC TGGTTCTGGA TCCGGGTCTA CTACCTGCCG GTGCCGGCCC TGATGATCGA GCGGACCCGG GTGCTGGCCG CGATCGGGCG GGGCTTCCGG CTGACCCGGC GGCAGTTCTG GCGGACCTTC GGGATCGCGC TGCTGACCGT GATCGTCACG GGGATCGCCG GGAGCATGCT GTCGGCGCCG TTCACGATCG CCGCTCAGCT GCTGCCGCTG GCGATGGCCG AGTCCCGCTA CGCCGTACTC GTCCTGGTCG TGCTGAGCGC GATCGCCACC GTCGTCCAGA CCGCGTTCGT CACCCCCTTC ACCTCCGCGG TCACGAGCGT GCAGTACCTC GACCAGCGGA TCCGCAAGGA GGCCTACGAC GTCGAGCTGA TGACCCAGGC CGGGATCACC GCGTCGTGA
|
Protein sequence | MSGDQYPPPG GSPPSPPPPG WQPPPPPTPV PPAPGWAQSP VVAPGMLGAA HKPGAMPLRP LALGDMYDAA FRIIRFNPKA TVGSAVLVAA VAMGVPVLVT ALLTLVVDLS AAQSGDDLST AEVVGYAGSI GSLFLGTALQ SVGSILVTGM VAHVTAAAAI GRRLTLGEAW AATRGSRWRL VGLTLLLGLM LAGLLLAYGL LWILAVVLLP TWAIVLFGVL SVPAFLAFAC WFWIRVYYLP VPALMIERTR VLAAIGRGFR LTRRQFWRTF GIALLTVIVT GIAGSMLSAP FTIAAQLLPL AMAESRYAVL VLVVLSAIAT VVQTAFVTPF TSAVTSVQYL DQRIRKEAYD VELMTQAGIT AS
|
| |