Gene Noca_4403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4403 
Symbol 
ID4596921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4656072 
End bp4658501 
Gene Length2430 bp 
Protein Length809 aa 
Translation table11 
GC content72% 
IMG OID639779013 
Producthypothetical protein 
Protein accessionYP_925587 
Protein GI119718622 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAGAGA AGAGTCCCGG CCGCCGGCTG GCGTGGGCCC TGGTCGCCGC CGCCGTGGTG 
GTCGCCACCG CGATCTCGAT GGGCGGGGCA GCGGGGCTGC GGCCCGGCGG CCTGGGTGAG
CGCTATGCGT TCGCCCAGGA CGCGCTCGGC CAGCGGAGCG TCGACAAGAA CGCGATCTCC
CGGGCGGGCG ACCTCGGCGA CCTCACCCGG GACCCCACCA CCGCGGCCGA GAAGGCCGCG
GCCACGGCGT ACGTCGACCA CGAGCGCGCG CTGCCCGACC CCGAGCTGAC CACGGCGCCG
ATCATCGGCG CGCGACACCG GCACCCGCAG GACCGCTACG CCCTGGCCGG CGGCTGCTAC
ACGCTGACTC CCGACGGTGA CCCGTTCTAC TTCAAGGCCA CCGACCTCGG CAGCTACCTG
ATCTACGACG CCGACCGGGC GTTCCTGCTC GCCGACGGCA ACCGCGCCGG CGAGCCGAGC
GCCGACACCG TCTGGGACAT CTCCCGGTCC GCCGGCGGGC GCTACTCCTT CCGCAACGAC
ACCGGCGCGC TGACCCTCGG CAGCACGACG GCGTTCCGCC CGACCCGGAC GACCGGCTGC
ACGCCGTACC CCGAGTCCCA GATCGACATC ACCGGCCGCC CGCACGCGGG GGTGACGCCG
TTCCAGGAGG TCCGCGGCTA CGTCGACGGG CACACCCACG GGATGGCCTT CGAGTTCCTC
GGGGGCGACG TCCACTGCGG CAAGCCCTGG GACCGGTACG GCGCGCCGTA CGCCCTGGTC
GACTGCCCCG ACCACACCGC CACCGGCGGG TACGGCGGCG TCCTGGAGTC GGTGCTCTCC
GGGGAGCCGC ACCACGACCC CGTGGGCTGG CCGACCTTCA AGGACTGGCC GGCGCCGAAC
TCGCTGACCC ACGAGGGCAC CTACTACCGC TGGCTGGAGC GGTCCTGGCG GGGCGGCCAG
CGGATCTTCG TCAACCTGCT CGTCGAGAAC AACCAGCTCT GCCAGATCTA CCCGATCAAG
CGGAACTCCT GCGACGACAT GGACTCGGTG CGCCTGCAGG CCAAGGACAT GTACGCCATG
CAGGACTACA TCGACGCCCA GTTCGGCGGT CCCGGCAAGG GCTTCTACCG GATCGTCAAG
AACCCCTACC AGGCCCGCGA GGTCATCAAC GCCGGGAAGA TGGCGGTGGT CATGGGCATC
GAGACCAGCG TCCCGTTCGG CTGCACGTTC AAGGCGCTGC CGGGCGGCGA CGTGCCGGCC
TGCGACGTCG ACGACATCGA CGCCCAGCTC GACCAGGTCA AGCGGATGGG CGTGCGCCAG
ATGGAGCTGG TGAACAAGTT CGACAACGCC CTGTCCGGCA TCGCCGGCGA CAACGGCGAG
GTCGGGGTCG CGGTCAACAG CGCGAACTTC CTCGAGACCG GCACGTTCTG GGACATGGAG
CACTGCGAGC CCGAGATCCC GGGCGCGCAC GACCACAACC AGCTCGCGGC GCCCGACATC
TCGGCGGGAC AGCAGGACGC GCTGTTCGGT GCGATCGGCG AGCTCTTCGG GCCGCTCGCG
CTGCCGGCCC TCCCGGTCTA CCCGCCGCCG GACCACTGCA ACAGCCGCGG CCTGACCACG
CTGGGCGAGT ACACGATCCG CGGCCTCGCG CAGCGGCACA TGATCTTCGA CCCCGACCAC
ATGAGCGTGA GGGCCCGGAC CCGGGCGCTC GACCAGATCG ACGACCTGGG CTACCACGGC
GTCATCTCGA GCCACTCCTG GTCCACGCCC GACGCGTACC CGCGGATCTA CGCCGACGGC
GGGTTCATCA CGCCGTACGC CGGCGACTCG ACCGGCTTCG TCGAGAAGTG GCGCACCCAC
CTCGGCTGGG CCGACCCGCG CTACTACTGG GGGATCGGGT ACGGCGCGGA CATGAACGGG
CTCGGTGCGC AGGGCGACCC GCGCGGCGCG GACGTGCCGA ACCCGGTCAC CTACCCGTTC
ACCGGGCTCG GCGGCGTCCG CGTCGGTCGG CAGCACGCCG GCGAGCGCGT GTACGACATC
AACGTCGACG GCGTCGCTCA GTACGGCCTC TACCCCGACT GGATCCAGGA CCTCGGCCAG
GTCGCCGATG CCCAGCACCC GGGCGACGGC GCGAAGATCG AGGACGACAT GGCGCGCGGG
GCCGAGGCCT ACCTGCAGAT GTGGGAGCGG GCCGAGGGCA TCGCGCCGGA CTCCTGCCGC
AACGCCGACC TGCGCCTGCC GGTCGACCGG GCCCTGGGGC TGGTCCGCCC CGGCATGACC
ACCAAGCAGG TCATGCAGGC CGTCGGCCAG CCCTACACCC GGCTCGGCGC GACGTACGGC
TTCTGCGCGA GGACGTCCGC GGACCCGAAG GTGATGGTGA CCGTGACGTT CTCGGAGGCG
GGCCGGGTCA CCGGCGCGCG CCGGGCCTGA
 
Protein sequence
MREKSPGRRL AWALVAAAVV VATAISMGGA AGLRPGGLGE RYAFAQDALG QRSVDKNAIS 
RAGDLGDLTR DPTTAAEKAA ATAYVDHERA LPDPELTTAP IIGARHRHPQ DRYALAGGCY
TLTPDGDPFY FKATDLGSYL IYDADRAFLL ADGNRAGEPS ADTVWDISRS AGGRYSFRND
TGALTLGSTT AFRPTRTTGC TPYPESQIDI TGRPHAGVTP FQEVRGYVDG HTHGMAFEFL
GGDVHCGKPW DRYGAPYALV DCPDHTATGG YGGVLESVLS GEPHHDPVGW PTFKDWPAPN
SLTHEGTYYR WLERSWRGGQ RIFVNLLVEN NQLCQIYPIK RNSCDDMDSV RLQAKDMYAM
QDYIDAQFGG PGKGFYRIVK NPYQAREVIN AGKMAVVMGI ETSVPFGCTF KALPGGDVPA
CDVDDIDAQL DQVKRMGVRQ MELVNKFDNA LSGIAGDNGE VGVAVNSANF LETGTFWDME
HCEPEIPGAH DHNQLAAPDI SAGQQDALFG AIGELFGPLA LPALPVYPPP DHCNSRGLTT
LGEYTIRGLA QRHMIFDPDH MSVRARTRAL DQIDDLGYHG VISSHSWSTP DAYPRIYADG
GFITPYAGDS TGFVEKWRTH LGWADPRYYW GIGYGADMNG LGAQGDPRGA DVPNPVTYPF
TGLGGVRVGR QHAGERVYDI NVDGVAQYGL YPDWIQDLGQ VADAQHPGDG AKIEDDMARG
AEAYLQMWER AEGIAPDSCR NADLRLPVDR ALGLVRPGMT TKQVMQAVGQ PYTRLGATYG
FCARTSADPK VMVTVTFSEA GRVTGARRA