Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4404 |
Symbol | |
ID | 4596922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4658649 |
End bp | 4661750 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639779014 |
Product | hypothetical protein |
Protein accession | YP_925588 |
Protein GI | 119718623 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.326408 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACTGCAG TGCTCATCTT GGGTCTCGTG ATCAACGCGA TCGCCCTGCC GATCGCGGGG AAGCGGGCGC TGTTCCTCTA CCGACTGATC AGCAGCGGCC AGCCCGCGCC GGACCGGATC GCGGGGGTGA CCCGGCGCAT CGGCGCCGCC GCCAAGCGGC AGGTGCTCGA GGTGTTCGGC CAGCGCAAGA TGCTGAAGTG GACGGTGCCC GGCACCGCGC ACTTCTTCGT GATGTGGGCC TTCTTCATCC TCGCCACCGT CTACCTCGAG GCGTACGCCG TCTTGTTCGC GCGCGACTCC GGGTGGCACT GGTTCGTCTT CAACAGCTGG GGCGTCCTCG GCTTCCTGCA GGACTTCATC GCCGTGATGT GCACGCTCGG CATCGTGGTC TTCTGGGCCA TCCGGCTGCG CAACCAGCCC CAGCAGATGG GGCGCAAGTC CCGCTTCTTC GGCTCGCACC TCGGACCGGC GTACTTCACG CTGTTCATGA TCTTCAACGT CATCTGGACG ATGTTCCTGT TCCGCGGGGC GGTCGAGTCC CGCGACATGG GCACCGAGCA CGGCTACGGC AAGGCGGCGT TCGTCTCCTA CCTGCTCGGC AAGGTGCTGC CGGACAGCAC CGCCCTGATC GGCATCGGCC TGCTCCTGCA CATCGGCGTG ATGCTCGCGT TCTTGATCTT CGTGCTGAAC TCCAAGCACC TGCACATCTT CCTGGCGCCG CTCAACGTGC TCTTCGGCCG CGAGCCGAAG GCGCTCGGTG CGGTCAAGCC GCTGATCTCG GCGGGCAAGC CGGTCACGCT CGACGACATC GACGACCTCG ACGAGGACGC CAAGCTCGGC GCCGGGGCGA TCGAGGACTT CACCTGGAAG GGCCTGCTCG ACATGGCGAC CTGCACCGAG TGCGGTCGAT GCCAGTCGCA GTGCCCGGCC TGGAACACCG AGAAGCCGCT GTCGCCGAAG CTGATGATCA TGGCGCTGCG CGACGCGTCG TTCGCGAAGG CGCCGTACCT CCTCGCCGAC GAGGGCAAGC GCGCCGGCCT CCTCGAGGGC AGCGACACGC TCACCAAGGA GGTGGAGCGG CCACTGGTCG GCGACACCGG TGACGAGTGG TTCTACATGC CCGAGGACGG CTCCGCGGTC ATCGACCCCG ACGTGCTCTG GTCCTGCGTC ACCTGCGGCG CCTGCGTCGA GCAGTGCCCG GTCGACATCG AGCACGTCGA CCACATCGTC GACATGCGCC GCTACCAGGT GCTGGTCGAG TCGAACTTCC CCAGCGAGCT CAACCAGCTG TTCCGCGGCC TGGAGAACAA CGGCAACCCG TGGAACATGT CGCCCAACGC GCGGCTGGAC TGGGCCAAGG GCCTGGACTT CGAGGTCAAG GTCGTCGGCG AGACGATCGA GTCGCTCGAC GAGGTCGACT GGCTGTTCTG GGTCGGCTGC GCCGGCGCGT ACGAGGACCG TGCGAAGAAG ACGACCCGCG CGGTCGCCGA GCTGCTCGAC ATCGCCGGGG TGAGCTTCGG CGTCCTCGGC AACGGGGAGA CCTGCACCGG CGACCCGGCC CGGCGCGCCG GCAACGAGTT CGTCTTCCAG GGCCTCGCCC AGCAGAACGT CGAGACGTTC AAGGAGACCC GGGTCAAGAA GGTCGTCTCG ACCTGCGCCC ACTGCTTCAA CACGCTCAAG AACGAGTACA AGGAGTTCGG CATCGAGCTC GAGGTCGTGC ACCACACCCA GCTGCTCAAC CGGCTGGTGC GCGAGGGCAA GCTGACCCCG ATCCGTGACG GTGCCGGCGC GCACAAGCGC AAGATCACCT ACCACGACCC GTGCTACATC GGCCGCCACA ACGGCGTCTA CGCACCGCCC CGCGAGCTGC TGCAGGTGCT GCCCGGCGCC GAGGTCGTCG AGATGGAGCG CAACTCCGAG CGGTCCTTCT GCTGCGGTGC CGGCGGCGCG CGGATGTGGA TGGAGGAGAC GATCGGCGAG CGGATCAACG AGAACCGCAC CGCCGAGGCC GTCGGCACCG GCGCCGACCA GATCGCGGTC GGGTGCCCGT TCTGCCGGGT GATGCTCTCC GACGGGCTCA CCGCCCAGCA GGACAAGGGC GCCGCCCGTG CCGAGGTCGA GGTCCTCGAC GTCGCGCAGA TGCTGCTCGC CTCGGTCAAG GGCGAGATGG CGACCCGGCA TGCCCCCGGC TCCCTGGCCG CCGCGGCACC CGCCGCTCGC TCGGAGGAGA CGAAGGCCGA GCCGGAGCCC GGCGACGCCA CCCAGACGGC CGACACCGTC ACCGAGACCG CCGACGTCGG GCCGGCCGCG AAGGCCTCCG GCGGGTCGTC GCTGTTCGAC ACCCCGGCCG ACTCGGCGAC CGCCACCGAG GACAGCTCCG TCGCGGAGGA GGCCCGGGCC GCCAAGCCGG CGTCCTCGGG CGGTTCGCTG TTCGACCTCG GCGGGGACAC CGACTCCACC GTCGCGGCGA AGCCGACCCC CGAGGCGCAG ACCGAGGTGG CCAACACCGG CAGCGACACC GCGACCACCG GGTCGCTGTT CGACCTCGCC GGCGACCAGC CGGCCGAGCA GCCGAAGGCG GCCGAGCCGG ACGCGAAGCC CGAGACGGAG TCGAAGGAGG AGGCTCCCGA GCCGCCGGCC GCGACGACCC CGGCCGCCGG CGCGGACCTC GGATCCGGCT CGCTGTTCGA CATCGTCGCC GACGAGCCGG CCGCCTCGGC GCCGAAGGCC GCCGAGCCCG AGCCCAAGGC CGAGCCCGCG CGGCCCGAGC CCGCCGCACC CGCGGCCGAG GCGAAGCCCG AGCTCGACCT GAGCTCGGGT GGCTCGCTGT TCGACATCGC GGCCCCCGAC CCGCAGGAGC TCAGTGCTTC GGCCACCGCG GCCGCCAACG CCGCGTCCGG CGCGACGCCT GAGCCGGAGG CCGCGGCCGA GCCGGAGGTC GAGGAGCCGG AGGTCGAGGA GCCCGAGGAG CCCGAGGCCG CCGCACCCGC GGCCGAGGAG CAGCAGTCCG AGGAGAAGCC GAAGCCGCCG GCCGGCGGCG CGGCGCACCA GCCGAAGACC GACGTCGACA TCCACGAGAC CGGCTCGCTC TTCGACCTCT AG
|
Protein sequence | MTAVLILGLV INAIALPIAG KRALFLYRLI SSGQPAPDRI AGVTRRIGAA AKRQVLEVFG QRKMLKWTVP GTAHFFVMWA FFILATVYLE AYAVLFARDS GWHWFVFNSW GVLGFLQDFI AVMCTLGIVV FWAIRLRNQP QQMGRKSRFF GSHLGPAYFT LFMIFNVIWT MFLFRGAVES RDMGTEHGYG KAAFVSYLLG KVLPDSTALI GIGLLLHIGV MLAFLIFVLN SKHLHIFLAP LNVLFGREPK ALGAVKPLIS AGKPVTLDDI DDLDEDAKLG AGAIEDFTWK GLLDMATCTE CGRCQSQCPA WNTEKPLSPK LMIMALRDAS FAKAPYLLAD EGKRAGLLEG SDTLTKEVER PLVGDTGDEW FYMPEDGSAV IDPDVLWSCV TCGACVEQCP VDIEHVDHIV DMRRYQVLVE SNFPSELNQL FRGLENNGNP WNMSPNARLD WAKGLDFEVK VVGETIESLD EVDWLFWVGC AGAYEDRAKK TTRAVAELLD IAGVSFGVLG NGETCTGDPA RRAGNEFVFQ GLAQQNVETF KETRVKKVVS TCAHCFNTLK NEYKEFGIEL EVVHHTQLLN RLVREGKLTP IRDGAGAHKR KITYHDPCYI GRHNGVYAPP RELLQVLPGA EVVEMERNSE RSFCCGAGGA RMWMEETIGE RINENRTAEA VGTGADQIAV GCPFCRVMLS DGLTAQQDKG AARAEVEVLD VAQMLLASVK GEMATRHAPG SLAAAAPAAR SEETKAEPEP GDATQTADTV TETADVGPAA KASGGSSLFD TPADSATATE DSSVAEEARA AKPASSGGSL FDLGGDTDST VAAKPTPEAQ TEVANTGSDT ATTGSLFDLA GDQPAEQPKA AEPDAKPETE SKEEAPEPPA ATTPAAGADL GSGSLFDIVA DEPAASAPKA AEPEPKAEPA RPEPAAPAAE AKPELDLSSG GSLFDIAAPD PQELSASATA AANAASGATP EPEAAAEPEV EEPEVEEPEE PEAAAPAAEE QQSEEKPKPP AGGAAHQPKT DVDIHETGSL FDL
|
| |