Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1997 |
Symbol | |
ID | 4598313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2136688 |
End bp | 2137848 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639776601 |
Product | UspA domain-containing protein |
Protein accession | YP_923194 |
Protein GI | 119716229 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.297843 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAGCG AGACGATCCC GGGCGGGACC ATCGTCGTGG GCGTCGACCA GACCCCCGCC GCGAGGGACG CGCTGGCCTG GGCGATCCAG GAGGCCGTGC ACGAGGAGAC GCCGCTGACG CTGGTCCACG GCGTCGGTGC GGCGCTGTCC GCGTGGGTCG ACTTCTCGAC GATCGACTTC CGCGACGTGA TCGACGCGAC CACCGCGACC GGGCGGGAGG TCCTCGACGC CGCGCAGGCC CAGGTCGACC GGGTCGCGCC CGCGGTCGAG GTGCACCACG TGCTCCGCCT GACCGACCCG CACCAGGCGC TGCTCGACCT CTCCGAGGAC GCGGCGATGC TGGTCGTGGG GTCCCGGACC GCGCCGCACG AGCGCGAGTC GCTGATGTCC TCGGCGCTCG GCTTCGTCCG CGACCGTGCC GGCTGCCCGG TGGTGGTCCC GCGGCTGCGC CGCAGCCCCG GCGTCGGCGT GGTCGTGCTC TGCGACGGCT CGCCGGAGTC CCAGTCGATC CTCGCCTACG CGTGGGGCCA GGCCGACCGC CGCGGCCTCC CGCTGACCGT CGTGCACTGC CTGCCCGACC CGCCGGCCGA CCTGCACCCC CGCGACATCA GCCTGATGGA CGCAACGGCT CGCATGGAGG CCAGCCTCGG GCCCTCGCTG CTCGGCTTCG AGCGGCTGCT GCTCAGCGAC CTGGTCCGTG ACATGCGGTC GCGCTGGCCG GGCGTGGACG TCCGCCTGGT CGTCGAGGAC GACGCGATCG ACAGGTGGCT CGAGCGCGCC CGGCAGCAGG CCGACCTGCT CGTGGTCGGC GCCAGGCACG CCCGCCGGCT GTCCGAGCTG GTCATCGGCA GCTCCACGCC GGAGGAGGTG GAGTGCGTCA CCGTCGTCCT GCCGTTGGAG GACCGGCTCG ATCCCGACGC GGACGCGGCG CGGGTGACCA TCCAGCGGCT CCATGGCTGT GCCCAGCGGC TGGTCCGGCT CAACATCCCC CTCGACCAAG CGGTGGCCGA GATCCGCTCG GTCACCATGG ACACCGACCT GCTCGCCGAG GCCGCCCTCA CCGCGCTGCG CGGCTGGGGC GCCACGACCG CCAAGAGCTG GCAGACCCGC GAGGTCACCG AGCTGCTGGT ACGGGCCGGA GCCCGCCGCG TCTGGCCCTG A
|
Protein sequence | MESETIPGGT IVVGVDQTPA ARDALAWAIQ EAVHEETPLT LVHGVGAALS AWVDFSTIDF RDVIDATTAT GREVLDAAQA QVDRVAPAVE VHHVLRLTDP HQALLDLSED AAMLVVGSRT APHERESLMS SALGFVRDRA GCPVVVPRLR RSPGVGVVVL CDGSPESQSI LAYAWGQADR RGLPLTVVHC LPDPPADLHP RDISLMDATA RMEASLGPSL LGFERLLLSD LVRDMRSRWP GVDVRLVVED DAIDRWLERA RQQADLLVVG ARHARRLSEL VIGSSTPEEV ECVTVVLPLE DRLDPDADAA RVTIQRLHGC AQRLVRLNIP LDQAVAEIRS VTMDTDLLAE AALTALRGWG ATTAKSWQTR EVTELLVRAG ARRVWP
|
| |