Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3804 |
Symbol | |
ID | 4599027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4019544 |
End bp | 4020641 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639778412 |
Product | hypothetical protein |
Protein accession | YP_924991 |
Protein GI | 119718026 |
COG category | [S] Function unknown |
COG ID | [COG4129] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAGGGC GTGGGCTCGG TGGGCCCCGT GACCCGCCCG GCGTCGCCGG GTACGGGTCC CCCATGAACA CCACCACCGC CGCACGCCGG GCCCGGGCCG GGGCCCGGGC GCGTCCGCAC TTCTTCATGG CCATCAAGGC GGCTCTCGCC GCGATGCTGG CCTGGCTCGT GGTGCAGCCG CTCGGCGGGT TCGCCGCGGA GTACTCCTTC TACGCCCCGC TCGGCGCCCT GGCCGTCGTC TCCACCTCCG TGGTGCGCTC CGCCCGGAGC GCGCTCGAGG TCGTGCTCGC GATCCTCCTC GGCTCGGTGC TCGCGCTCAT CGCCATCGGT GCTCCGCTGC CCGAGCCGAT CCCCCTCGGG CTGGCCGTGG TGGCGGGCGT GATCGTCGGC GGTGCGGGCC TCGCCGGCGG GATGGGCAGC TGGGTGCCGC TGGCCGCCAT GTTCGTGCTG GTCGCGGGCA AGGGCGACCC GATCGAGTAC ACCGCGGCGT ACGGAGGCCT GACCGCCCTC GGCGCCGTGG TCGGCGTCGG GGTCTACGTC GCGTTCCCCC AGCTGCCGCT CACCCCGGCC GCCCTCGCCC AGGAGCGGCT CCGCACGGAG CTCGCCGACC ACCTCGACGA GCTCGCCACG GCGCTCGAGC GGGAGATCGT CGGCGAACGG GACTGGAACT CGCTGCGGCA CTCCCTGGCC GGTACGGCGC GTGACGCCGA CGGCCTGATG GACGAGGCCC GCGACGCCAG GCGCGCCAAC TGGAAGGCGG CCCGCTGGGC CGAGAGCACC GAGCGGCACG ACCTGCGCGC TCAGGCCCTC CAGCGGCTCA CCGGCTGCGT CGACGAGGTC ATCGCCCTGG TCTCCGACCA GCGTGCGGAG ATCCGCCACG ACGACCCCGC CGCCGCCTCG CTCCGGGCGA GCACCGCCCA GGCCCTGCGG TGCGTCGCCG CCCTGCTTCG GGAGGAGGCC GATCCCGACG CAGCCCGCGC GGACGTCTCG GCCCTCCGCT CCCGCGTCGT CCAGGCCCAG AGCAGCACCG GCGACCACCA CTTCGCCGCC GCCGCGATCG TGCTCAACCT CGAGCAGGCC GTCGAAGCCT GGACCTGA
|
Protein sequence | MLGRGLGGPR DPPGVAGYGS PMNTTTAARR ARAGARARPH FFMAIKAALA AMLAWLVVQP LGGFAAEYSF YAPLGALAVV STSVVRSARS ALEVVLAILL GSVLALIAIG APLPEPIPLG LAVVAGVIVG GAGLAGGMGS WVPLAAMFVL VAGKGDPIEY TAAYGGLTAL GAVVGVGVYV AFPQLPLTPA ALAQERLRTE LADHLDELAT ALEREIVGER DWNSLRHSLA GTARDADGLM DEARDARRAN WKAARWAEST ERHDLRAQAL QRLTGCVDEV IALVSDQRAE IRHDDPAAAS LRASTAQALR CVAALLREEA DPDAARADVS ALRSRVVQAQ SSTGDHHFAA AAIVLNLEQA VEAWT
|
| |