Gene Noca_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3089 
Symbol 
ID4597874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3289614 
End bp3291734 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content55% 
IMG OID639777695 
Producthypothetical protein 
Protein accessionYP_924278 
Protein GI119717313 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.679404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAACA TATCCGAGCG CAACTTGAAG GCGTTTCATG AGGCTGCTCA GTCCTTGAAA 
CTGTACAGAC GCGCGGAGCT TTCGGACGCT CGCGGGGAGT CGCTAGTCGA GCATTTGTAC
GTGGATCCCT TGCCGAATGA CCACGTTCTC CAGACGATGT TGAAGAGCAA CACTACTTTC
CTGGTCGGGC GGAAGGGCAC TGGCAAATCG ACGGTCTTCC AGCGCGCGCA ACATGAGTTG
CGGAAAGCTG GAAAAGCGTC CGCATACATC GACATCAAGA CCATCTACGA ATCCAGCCAG
GTCGATCCGG GTCTGGCCGC GAAGCTATCG GCAATGCCGG GTGCCATGAA CTTGGACGAG
ATGGAGTCGT TGCTTCTCCA TCGAGCATTC TTGGCCGCGA TGATCCGAGC AATCCGCGAA
GAGATCAAGA GCCGAATCAA GGTCTCTCTA TGGCAGAAAG TTAAGAACTC GGTAGGTGGA
TCCGTGGATG ATCTGTTTGA GGGCTTGGAT GCCCTCCTTG AGGCAGCTGA CGCGGACCGC
TTTATCAGTG TTGTTTCCAC CCGTTTAGTC GAAGAGCGCA GGCAGGACTC CGACGAGGTC
ATCAACAAGC AAGGTGTGTC CGGAAAGGTT GGCGCGGGTG CCACAGGGCC CGAGGCCTCC
CTTGCGTTTG AAGACCAAAC GAGCTACACG TTCGGGTCGG AGCGTGAACG GCAGTTTGCG
GATATCTTGG TTCGCGTCTT CGACATCAAG TCGTATATAC TGCAACTCAA GGAGTTGCTG
AATCGAGTCG GACTGACACA GTTGTTCGTC TTCGTAGACG ATTTTTCGGA GCTTCCGCCC
CCCGCCATGC AGGTCGTCGT AGACGCTCTC CTCGGGCCCT TGAACAATTG GTCTGAAGAG
TTGATTAAGT TCAAGATCGC TGCCTATCCC GGCCGCATCT ACTACGGCCA AATCGATAAG
ACGAAGATTG ATGAGATCAG CCTCGACATC TACAGCCTGT ACGGCACCGC CGACATCGCG
GGGATGGAAG AGAAGGCGGT CGACTTCACG CGCCGTCTGA TCACGCGGCG TCTCGAGCAC
TTTGGTGTTC AAGCGCTCGG CGATGTCTTC GACACGCGTC GCTCCGGTGG AGATGAATTG
TGGCGTCAAT TGTTCTTCGC CACCATGGCA AACCCCCGTA ATATTGGCTA CGTATTGTAT
TTCATGTACG AGTCGAATCT CATCTATGGG CAGAAAATCA ATCTTGCCTC GATCCGGGAC
GCGGCGCGTA GGTACTACGA GGAAAAGATT GAGGCGTATT TCGCCATGAA TCGATTCCTG
CACGAGAGCT TCGATGAACG ATCTTCCATC TTTAGTCTCA AAGAACTACT CGACGCCCTC
GTCATCCGAG CGCGTGAGTT GCGGTATCAC ACCACCTCGG CAGTAATGCG AGACCTCGAA
GGTCGCCCAC CTACGAGCCA TTTCCACGTT GCGACGCAAA TGGAGGCTCT GCTCGCAACC
TTGGAGTTGA ACTTCTTCAT CACCAAGTAC TATGTGATGA GTGACCGAGA CGGTCGGAAG
GTCACCGTTT ACGCCCTTAA CTACGGGCTA TGTCAAAAGC AGAGCATTGA GTTCGGACGT
CCTGAGGGGA AGCGCGAGTA CCGGCTGTAC TACGTGGAAC GAGTGTTCGA TGTCACTGCG
ATCCTCCAAG AGTACGTCAG AATGAATCAG GAAATCGTCT GCGACTCATG CGGCCACAAC
TTCGATCATG AAGCGCTTCC CGCCCTACAA ATGTTTGACA TGCTTTGTCC AGAATGCAGA
ACGGGCAAGT GTGTCATTAC GAACCTCTCG AAGAAGTACG AGGCCACCCT TCGAGAGATT
GATCCTGAAC TTCTCCTGCC AAAGGTTGAG CTTGGGATTC TTCACACACT TGGAACCGAA
GAGCGGGCTC TTTATGCCGC CGAGGTCGCT GGCGAACTGG ACTGCTCGTA TCAACTCGTT
GGGAAGCGGG CGGTCAACTT GGCCGAGAGA GGCTTGGTGG ATCGTGACAA ATTCGACCAA
GGGCGCCGCG AGCTGAGGGT TACTGAGTTG GCTCAGCGAA CGTACTTGCG CGATCTAGTC
GAGGACGTCG AGCCGGATTG A
 
Protein sequence
MPNISERNLK AFHEAAQSLK LYRRAELSDA RGESLVEHLY VDPLPNDHVL QTMLKSNTTF 
LVGRKGTGKS TVFQRAQHEL RKAGKASAYI DIKTIYESSQ VDPGLAAKLS AMPGAMNLDE
MESLLLHRAF LAAMIRAIRE EIKSRIKVSL WQKVKNSVGG SVDDLFEGLD ALLEAADADR
FISVVSTRLV EERRQDSDEV INKQGVSGKV GAGATGPEAS LAFEDQTSYT FGSERERQFA
DILVRVFDIK SYILQLKELL NRVGLTQLFV FVDDFSELPP PAMQVVVDAL LGPLNNWSEE
LIKFKIAAYP GRIYYGQIDK TKIDEISLDI YSLYGTADIA GMEEKAVDFT RRLITRRLEH
FGVQALGDVF DTRRSGGDEL WRQLFFATMA NPRNIGYVLY FMYESNLIYG QKINLASIRD
AARRYYEEKI EAYFAMNRFL HESFDERSSI FSLKELLDAL VIRARELRYH TTSAVMRDLE
GRPPTSHFHV ATQMEALLAT LELNFFITKY YVMSDRDGRK VTVYALNYGL CQKQSIEFGR
PEGKREYRLY YVERVFDVTA ILQEYVRMNQ EIVCDSCGHN FDHEALPALQ MFDMLCPECR
TGKCVITNLS KKYEATLREI DPELLLPKVE LGILHTLGTE ERALYAAEVA GELDCSYQLV
GKRAVNLAER GLVDRDKFDQ GRRELRVTEL AQRTYLRDLV EDVEPD