Gene Noca_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3972 
Symbol 
ID4598107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4189469 
End bp4192450 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content73% 
IMG OID639778577 
ProductPKD domain-containing protein 
Protein accessionYP_925156 
Protein GI119718191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCGCGC CCGGTGAGAC GGTCACGTTC ACGGTCTCCA GCGACGCGAG CGGGAACCCG 
ACGTTCGTGT GGAACGTCGA CGGGCTCGAC GTCCAGTCCG ACAGCTCGTC GCTCCAGTGG
GCGTTCCAGG GCGAGGGTCA GCACACGGTC ACGGTGACCG TCGACGACGG GTTCGACTCC
GGCACCGCGT CGACGTCGAT CGAGGTCCGC ACGCCCCAGC CCAACCACCC GCCGTCGGTG
ACCCTCGACG CCGACCACCA GCAGGCCGCC CCCGGCGAGC CGGTCACGTT CACCGCGAGC
TTCGCCGACC CCGACCCCGG TGACTCCGTC GCGCTCGCCT GGTACGTCGA CGGCGACCGG
GCCGCCGAGA ACGGGCTCAC GTTGACCCGT TCGTTCGCGG CGACCGGCAC GCACGGCGTC
CGGGTCGTCG CGACCGATCT GTCGGGGGCC TCGAGCGAGG CGTCGATCCA GGTGAACGTG
GTGGGCACGG CGCCGGAGGC GGCGATCGCC GTACGCACGC CGTCGCCGAA GACCCGCAAG
GTGACCGTCC TCGACGCGTC CGGCTCGGTC CCGGCGTCGC CGTCGGGCTC GATCGTCTCC
TACCACTGGG ACCTCAACGG CAACGGCACC TTCGAGACCA CCTGCGCCGG ACCGGTCGTC
GGCGTGATCA GCGCCGCGGC GGGCGACCAC CCGGTCTCGG TGCTGGTGAC CGACGACAGC
GGCGGGACGT CGACGCCGGT CACGACGGTC CTCTCGCTCG CCACCTCGCG GCTCGACCAG
GAGTACGCCC CCGGCGCGCT CGCCGTCACC GCGGGCGGCT GCGCGGGACC GGCCAAGGAC
GGGGTGGTGC CCGAGGGGTA CCCCGCCGAC GACCTCACCT GCTACACGAC GGTCCGCGCC
GGCATCGCCG AGGCGATGTC GACCTGCTTC CGCCGGCGTA CGGCCCCCGT CGGTGACCGG
CTGCTCGTGA AGGAGCTCTA CGCCAGCACC AGGACCGTCC GGCTCAACGG CATCGACGTC
CGGCCCAGCT CCGGCGTGGC CATCGAGATC GACACCTGGA CCGCGGAGGT CAAGACGATC
GGCGGCAAAG CGAAGGCCTC GGTCGACGCC GGCAGCACGC TGGGCAAGCT GACGTTCTAC
TACGGCAGCA TCAGCTGGGA CCTGCCCAGC GGCAAGGCGT CGCGCTTCAA CCTCGGCAGC
CTGTCGATCA CCAAGGGCGC CGAGCTGTTC GGGCTCTCGG TCGAGGGCGA CGCCCGCCTC
GACCTCGTCT ACCGCGGCGC CGAGGTGCCC GTGACGATCC ACCTGCCCGC ACCGCTGGAC
GTGTCGGCGA CCGTCACCCT CAAGACCGAC AACCTCAAGG GGCTGCGCCT CGAGGAGGTG
CACCTGCGCG TCAAGAACGC GACCTTCGGG GCGTTCACGG TCAACGACCT CGACCTGCTC
TACAACGCGG CGGCCTACCA GTTCGACGGC TTCGCGGACC TCTCCCTGAC GTCCGTCGGC
AACCTGCAGG TGAGCATCCA GGTCGTCGGC AGGACCGTCA CCATGTTCGC CGCGAACTTC
ACGCCCGTGC CCCCGCTGGC GCTCGGGTCG GGGGTGTTCC TCCAGCACAT CGACTTCGGG
TACGACGCCG GCCCGCCGCT GACCCTCAAC GGCGGCGTGA AGCTCACCGC CGGACCGCCG
ATCAACGGCA CCGCGGCCGC CGCCATCGAG GGGACCTTGA AGTTCGTGGC GTCCGACCCG
TGGCTGCTGC GCGCCGACGG CAACGCGTCG ATCGCGGGCT TCGGCGTGGC CAGCGCCTAC
CTGCAGTACC AGTCGAACGG CATGATCCGG CTCGGCGGGC GGATCGACGC CCGGCTCTAC
GACATCGTCA GCGCCAAGGC CAACATCGAC ATGTGGCTCT ACGCGCCCAC GATGAAGTTC
AACGCCCAGG CCCGCGGCGA CGTCTGCGTC TGGAAGGGCT GTGGGGGCGG CGCGCTGGTG
GTGTCCAGCA CCGGCGTCGG CGGCTGCGTC TACACGTTCT TCGCGGACTT CGGCCTCGGG
TACCGGTGGG ACGGGGAGTT CAAGTACTAC CTGACCGGCT GCGACATCAA CCACTGGGCG
TCCGCGTGGG ACGGCGGCGC CCGCAACCGG CTGGCGTCGT ACGCCAGGAC CGCCGCGGTC
AGCTCCGAGG ACATGGTCGT CGCCCGCGGC GAGCGGGCCG TCTACTTCCG GTTCGCCTCC
GTCGACGACC CGCCGCAGGT CACCGTGACC GGACCCGACG GCACCGAGAT CGCGGTGCCC
GCCGGCACCG AGAACTTCGA GGCGACCGAC GACCACATCG TCCTGCAGGT GCCGCCCGAG
CACGCGACGT ACGTGATCGT CCGCGACCCC GCGCCCGGAC GCTGGCAGGT GCGCACCGCG
GACGGGACGC CCGACCTGGT GGCGGTCGGG CAGGCCTCGG CGCTGCCGAA GCCGCGGGTG
CGGGCGAGCG TCGGCGGTCG TGGGCACGCC CGGACCCTGG ACTACCGGCT GAACGCGGTC
GAGGGGCAGA CCGTGCAGTT CTTCGAGCGC GCCGGGCGCA CCACCCAGCG GCTGGGCACC
GTCTCCGACG CGCAGGGCCG GATCCGGTTC AGCCCGGCGC CGGGGCCGGC GGGCGAGCGG
TCGATCATCG CGGTGGTCCA GCACGACGGC GCACCGCGCG AGCAGGTCAC CGTGGCCAGC
TACCGGGCTC CCGGGCCGCT GCGCCCGGCC GCGACGCGTC GGATCGCCGT CAAGCGGACC
GCGACGCGGG CCGTCGTCAC CTGGGCGGAC GCGCGGCGCG CGGCGAGCTG GCGGGTGGTC
GCCACCGCCG ACGACGGGCG CCGGTGGTCG GTGCGGCTGG ACCGCCGCAC GCTGGTCCTG
CCCAGCGTGT TCCGCGGCAG GACCGTGACC GTGACGGTGC GCGGGGTCAG CGCGGACCAG
GTCGCCGGGC CGGCGAAGCG ACAGGTCTCA CGCGGTCGGT GA
 
Protein sequence
MAAPGETVTF TVSSDASGNP TFVWNVDGLD VQSDSSSLQW AFQGEGQHTV TVTVDDGFDS 
GTASTSIEVR TPQPNHPPSV TLDADHQQAA PGEPVTFTAS FADPDPGDSV ALAWYVDGDR
AAENGLTLTR SFAATGTHGV RVVATDLSGA SSEASIQVNV VGTAPEAAIA VRTPSPKTRK
VTVLDASGSV PASPSGSIVS YHWDLNGNGT FETTCAGPVV GVISAAAGDH PVSVLVTDDS
GGTSTPVTTV LSLATSRLDQ EYAPGALAVT AGGCAGPAKD GVVPEGYPAD DLTCYTTVRA
GIAEAMSTCF RRRTAPVGDR LLVKELYAST RTVRLNGIDV RPSSGVAIEI DTWTAEVKTI
GGKAKASVDA GSTLGKLTFY YGSISWDLPS GKASRFNLGS LSITKGAELF GLSVEGDARL
DLVYRGAEVP VTIHLPAPLD VSATVTLKTD NLKGLRLEEV HLRVKNATFG AFTVNDLDLL
YNAAAYQFDG FADLSLTSVG NLQVSIQVVG RTVTMFAANF TPVPPLALGS GVFLQHIDFG
YDAGPPLTLN GGVKLTAGPP INGTAAAAIE GTLKFVASDP WLLRADGNAS IAGFGVASAY
LQYQSNGMIR LGGRIDARLY DIVSAKANID MWLYAPTMKF NAQARGDVCV WKGCGGGALV
VSSTGVGGCV YTFFADFGLG YRWDGEFKYY LTGCDINHWA SAWDGGARNR LASYARTAAV
SSEDMVVARG ERAVYFRFAS VDDPPQVTVT GPDGTEIAVP AGTENFEATD DHIVLQVPPE
HATYVIVRDP APGRWQVRTA DGTPDLVAVG QASALPKPRV RASVGGRGHA RTLDYRLNAV
EGQTVQFFER AGRTTQRLGT VSDAQGRIRF SPAPGPAGER SIIAVVQHDG APREQVTVAS
YRAPGPLRPA ATRRIAVKRT ATRAVVTWAD ARRAASWRVV ATADDGRRWS VRLDRRTLVL
PSVFRGRTVT VTVRGVSADQ VAGPAKRQVS RGR