Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3704 |
Symbol | |
ID | 4597621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 3931154 |
End bp | 3932254 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639778312 |
Product | hypothetical protein |
Protein accession | YP_924891 |
Protein GI | 119717926 |
COG category | [S] Function unknown |
COG ID | [COG4129] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0978741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGCGG GGCCGCTGGA CCGGATGTGG GACCGGAGCC GGACCTCGGT CCGGGCTCGG ATCCGCCGGC TGCAGGCCAA GTCCTTCCAG ATCGCGCAGT GTGCGCTCGC CGCCGCGGTC GCCTGGTTGA TCGCGGCCGA CGTCCTGGGC CATCCCACCC CGTTCTTCGC CCCGGTGGCC GCCGTGGTCG CCCTCGGCAC CTCCTACGGC CAGCGGCTGC GCCGGGTCGC CGAGGTCACC ATCGGTGTGG CCGTCGGCGT CTTCCTCGCC GACCTGCTGG TGGTGTGGCT GGGCTCGGGC TGGTGGCAGA TCGGGCTGAT CGTCGCGCTC GCGATGGCGG CGGCCTTCCT GCTCGACGGC GGCCAGCTGT TCGTGACCCA GGCGGCGGTG CAGTCCATCG TGGTCGCGTC GCTCGTCCCG GACCCGGGGG CGGCGTTCAC CCGGTGGACC GACGCGGTCG TGGGTGGCGC GGTCGCACTG GTCGCGGCCA CCGCGGTGCC GGCCGCGCCG CTACGCCGGC CGCGCGAGCA GGCCGCGGTC GTGATGCGCA AGATCGCAGC CCTGCTGCGG GCGGCCGGTG AGGTGATGGA CGACGGGGAG GCGGCCCGGG CGCTCGACCT GCTCGCCGAC GCCCGCTCGA CCGACTACCT GATCCGCGAG CTCAAGGACG CCGCCGACGA AGGCCTCTCC GTGGTCGAGT CCTCGCCGTT CCGGATCCGC CACAAGGGCG ACCTGCGCAA GATGGTCGAC CTGGTCGACC CGCTCGACCG GGCGCTGCGC AGCACCCGCG TCCTGGTCCG GCACACCGCG GTCGCGGCGT ACCGCCGGCG CCCGGTGCCG TCGTCCTACT CGCTGCTCGC CCTCGACCTC GCCGACGCGG CCGACCGGGT TGCCGACGAG CTGGCCGCGA ACCGGATGGC GGCCGCTGCC CGAGGCGTCC TGCTCGTGGT GGGCGAGGCC ACCGGGCAGG TGGAGCGCAC CGACCTGCTC ACCGCCGAGG CGATCCTGGC CCAGCTGCGG GCGGTCGTCG CGGACCTCCT GATGGTCACC GGGCTCGACC AGATGGAGTC GACCGAGGCC CTGCCCCCGC CACCGCGATG A
|
Protein sequence | MEAGPLDRMW DRSRTSVRAR IRRLQAKSFQ IAQCALAAAV AWLIAADVLG HPTPFFAPVA AVVALGTSYG QRLRRVAEVT IGVAVGVFLA DLLVVWLGSG WWQIGLIVAL AMAAAFLLDG GQLFVTQAAV QSIVVASLVP DPGAAFTRWT DAVVGGAVAL VAATAVPAAP LRRPREQAAV VMRKIAALLR AAGEVMDDGE AARALDLLAD ARSTDYLIRE LKDAADEGLS VVESSPFRIR HKGDLRKMVD LVDPLDRALR STRVLVRHTA VAAYRRRPVP SSYSLLALDL ADAADRVADE LAANRMAAAA RGVLLVVGEA TGQVERTDLL TAEAILAQLR AVVADLLMVT GLDQMESTEA LPPPPR
|
| |