Gene Noca_4404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4404 
Symbol 
ID4596922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4658649 
End bp4661750 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content71% 
IMG OID639779014 
Producthypothetical protein 
Protein accessionYP_925588 
Protein GI119718623 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.326408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTGCAG TGCTCATCTT GGGTCTCGTG ATCAACGCGA TCGCCCTGCC GATCGCGGGG 
AAGCGGGCGC TGTTCCTCTA CCGACTGATC AGCAGCGGCC AGCCCGCGCC GGACCGGATC
GCGGGGGTGA CCCGGCGCAT CGGCGCCGCC GCCAAGCGGC AGGTGCTCGA GGTGTTCGGC
CAGCGCAAGA TGCTGAAGTG GACGGTGCCC GGCACCGCGC ACTTCTTCGT GATGTGGGCC
TTCTTCATCC TCGCCACCGT CTACCTCGAG GCGTACGCCG TCTTGTTCGC GCGCGACTCC
GGGTGGCACT GGTTCGTCTT CAACAGCTGG GGCGTCCTCG GCTTCCTGCA GGACTTCATC
GCCGTGATGT GCACGCTCGG CATCGTGGTC TTCTGGGCCA TCCGGCTGCG CAACCAGCCC
CAGCAGATGG GGCGCAAGTC CCGCTTCTTC GGCTCGCACC TCGGACCGGC GTACTTCACG
CTGTTCATGA TCTTCAACGT CATCTGGACG ATGTTCCTGT TCCGCGGGGC GGTCGAGTCC
CGCGACATGG GCACCGAGCA CGGCTACGGC AAGGCGGCGT TCGTCTCCTA CCTGCTCGGC
AAGGTGCTGC CGGACAGCAC CGCCCTGATC GGCATCGGCC TGCTCCTGCA CATCGGCGTG
ATGCTCGCGT TCTTGATCTT CGTGCTGAAC TCCAAGCACC TGCACATCTT CCTGGCGCCG
CTCAACGTGC TCTTCGGCCG CGAGCCGAAG GCGCTCGGTG CGGTCAAGCC GCTGATCTCG
GCGGGCAAGC CGGTCACGCT CGACGACATC GACGACCTCG ACGAGGACGC CAAGCTCGGC
GCCGGGGCGA TCGAGGACTT CACCTGGAAG GGCCTGCTCG ACATGGCGAC CTGCACCGAG
TGCGGTCGAT GCCAGTCGCA GTGCCCGGCC TGGAACACCG AGAAGCCGCT GTCGCCGAAG
CTGATGATCA TGGCGCTGCG CGACGCGTCG TTCGCGAAGG CGCCGTACCT CCTCGCCGAC
GAGGGCAAGC GCGCCGGCCT CCTCGAGGGC AGCGACACGC TCACCAAGGA GGTGGAGCGG
CCACTGGTCG GCGACACCGG TGACGAGTGG TTCTACATGC CCGAGGACGG CTCCGCGGTC
ATCGACCCCG ACGTGCTCTG GTCCTGCGTC ACCTGCGGCG CCTGCGTCGA GCAGTGCCCG
GTCGACATCG AGCACGTCGA CCACATCGTC GACATGCGCC GCTACCAGGT GCTGGTCGAG
TCGAACTTCC CCAGCGAGCT CAACCAGCTG TTCCGCGGCC TGGAGAACAA CGGCAACCCG
TGGAACATGT CGCCCAACGC GCGGCTGGAC TGGGCCAAGG GCCTGGACTT CGAGGTCAAG
GTCGTCGGCG AGACGATCGA GTCGCTCGAC GAGGTCGACT GGCTGTTCTG GGTCGGCTGC
GCCGGCGCGT ACGAGGACCG TGCGAAGAAG ACGACCCGCG CGGTCGCCGA GCTGCTCGAC
ATCGCCGGGG TGAGCTTCGG CGTCCTCGGC AACGGGGAGA CCTGCACCGG CGACCCGGCC
CGGCGCGCCG GCAACGAGTT CGTCTTCCAG GGCCTCGCCC AGCAGAACGT CGAGACGTTC
AAGGAGACCC GGGTCAAGAA GGTCGTCTCG ACCTGCGCCC ACTGCTTCAA CACGCTCAAG
AACGAGTACA AGGAGTTCGG CATCGAGCTC GAGGTCGTGC ACCACACCCA GCTGCTCAAC
CGGCTGGTGC GCGAGGGCAA GCTGACCCCG ATCCGTGACG GTGCCGGCGC GCACAAGCGC
AAGATCACCT ACCACGACCC GTGCTACATC GGCCGCCACA ACGGCGTCTA CGCACCGCCC
CGCGAGCTGC TGCAGGTGCT GCCCGGCGCC GAGGTCGTCG AGATGGAGCG CAACTCCGAG
CGGTCCTTCT GCTGCGGTGC CGGCGGCGCG CGGATGTGGA TGGAGGAGAC GATCGGCGAG
CGGATCAACG AGAACCGCAC CGCCGAGGCC GTCGGCACCG GCGCCGACCA GATCGCGGTC
GGGTGCCCGT TCTGCCGGGT GATGCTCTCC GACGGGCTCA CCGCCCAGCA GGACAAGGGC
GCCGCCCGTG CCGAGGTCGA GGTCCTCGAC GTCGCGCAGA TGCTGCTCGC CTCGGTCAAG
GGCGAGATGG CGACCCGGCA TGCCCCCGGC TCCCTGGCCG CCGCGGCACC CGCCGCTCGC
TCGGAGGAGA CGAAGGCCGA GCCGGAGCCC GGCGACGCCA CCCAGACGGC CGACACCGTC
ACCGAGACCG CCGACGTCGG GCCGGCCGCG AAGGCCTCCG GCGGGTCGTC GCTGTTCGAC
ACCCCGGCCG ACTCGGCGAC CGCCACCGAG GACAGCTCCG TCGCGGAGGA GGCCCGGGCC
GCCAAGCCGG CGTCCTCGGG CGGTTCGCTG TTCGACCTCG GCGGGGACAC CGACTCCACC
GTCGCGGCGA AGCCGACCCC CGAGGCGCAG ACCGAGGTGG CCAACACCGG CAGCGACACC
GCGACCACCG GGTCGCTGTT CGACCTCGCC GGCGACCAGC CGGCCGAGCA GCCGAAGGCG
GCCGAGCCGG ACGCGAAGCC CGAGACGGAG TCGAAGGAGG AGGCTCCCGA GCCGCCGGCC
GCGACGACCC CGGCCGCCGG CGCGGACCTC GGATCCGGCT CGCTGTTCGA CATCGTCGCC
GACGAGCCGG CCGCCTCGGC GCCGAAGGCC GCCGAGCCCG AGCCCAAGGC CGAGCCCGCG
CGGCCCGAGC CCGCCGCACC CGCGGCCGAG GCGAAGCCCG AGCTCGACCT GAGCTCGGGT
GGCTCGCTGT TCGACATCGC GGCCCCCGAC CCGCAGGAGC TCAGTGCTTC GGCCACCGCG
GCCGCCAACG CCGCGTCCGG CGCGACGCCT GAGCCGGAGG CCGCGGCCGA GCCGGAGGTC
GAGGAGCCGG AGGTCGAGGA GCCCGAGGAG CCCGAGGCCG CCGCACCCGC GGCCGAGGAG
CAGCAGTCCG AGGAGAAGCC GAAGCCGCCG GCCGGCGGCG CGGCGCACCA GCCGAAGACC
GACGTCGACA TCCACGAGAC CGGCTCGCTC TTCGACCTCT AG
 
Protein sequence
MTAVLILGLV INAIALPIAG KRALFLYRLI SSGQPAPDRI AGVTRRIGAA AKRQVLEVFG 
QRKMLKWTVP GTAHFFVMWA FFILATVYLE AYAVLFARDS GWHWFVFNSW GVLGFLQDFI
AVMCTLGIVV FWAIRLRNQP QQMGRKSRFF GSHLGPAYFT LFMIFNVIWT MFLFRGAVES
RDMGTEHGYG KAAFVSYLLG KVLPDSTALI GIGLLLHIGV MLAFLIFVLN SKHLHIFLAP
LNVLFGREPK ALGAVKPLIS AGKPVTLDDI DDLDEDAKLG AGAIEDFTWK GLLDMATCTE
CGRCQSQCPA WNTEKPLSPK LMIMALRDAS FAKAPYLLAD EGKRAGLLEG SDTLTKEVER
PLVGDTGDEW FYMPEDGSAV IDPDVLWSCV TCGACVEQCP VDIEHVDHIV DMRRYQVLVE
SNFPSELNQL FRGLENNGNP WNMSPNARLD WAKGLDFEVK VVGETIESLD EVDWLFWVGC
AGAYEDRAKK TTRAVAELLD IAGVSFGVLG NGETCTGDPA RRAGNEFVFQ GLAQQNVETF
KETRVKKVVS TCAHCFNTLK NEYKEFGIEL EVVHHTQLLN RLVREGKLTP IRDGAGAHKR
KITYHDPCYI GRHNGVYAPP RELLQVLPGA EVVEMERNSE RSFCCGAGGA RMWMEETIGE
RINENRTAEA VGTGADQIAV GCPFCRVMLS DGLTAQQDKG AARAEVEVLD VAQMLLASVK
GEMATRHAPG SLAAAAPAAR SEETKAEPEP GDATQTADTV TETADVGPAA KASGGSSLFD
TPADSATATE DSSVAEEARA AKPASSGGSL FDLGGDTDST VAAKPTPEAQ TEVANTGSDT
ATTGSLFDLA GDQPAEQPKA AEPDAKPETE SKEEAPEPPA ATTPAAGADL GSGSLFDIVA
DEPAASAPKA AEPEPKAEPA RPEPAAPAAE AKPELDLSSG GSLFDIAAPD PQELSASATA
AANAASGATP EPEAAAEPEV EEPEVEEPEE PEAAAPAAEE QQSEEKPKPP AGGAAHQPKT
DVDIHETGSL FDL