Gene Noca_0131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0131 
Symbol 
ID4597580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp143575 
End bp145524 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content69% 
IMG OID639774741 
Productendothelin-converting protein 1 
Protein accessionYP_921363 
Protein GI119714398 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.029376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCATCC TCGACGAAGC CCGGGCAGGC ATGAGCCCCG AGATCCGGCC CCAGGACGAC 
CTCTTCGGCC ACGTGAACGG CCGCTGGCTG GACGAGACCG AGATCCCGGC GGACCGCTCG
AGCTGGGGGC CCTTCATCCA GCTCGCCGAC ACCGCCGAGA CCCAGGTCCA CGAGATCATC
GAGGACCTCG CGCGCCGGGT CGCGGCCGGC GAGCCGGTCG ACGAGGACGC CCACAAGATC
GGCGACCTGT TCGCCTCGTT CATGGACACC GAGACGATCG CGCGCAACGG CCTGCGGCCG
GTGCGCCCCC TGATCGAGGC CGTCGCGGGG CTGCGCGACG TCCGCGACCT CGCTGCGTTC
CTCGGCGAGT TCGAGCGGAT CGGCGGCCAC GGCCTGTTCG GCTCCTACGT CGACACCGAC
TCCAAGAACT CCGACCGCTA CCTGTTCAAC CTGGTGCAGG GCGGGCTCGG CCTGCCCGAC
GAGTCCTACT ACCGCGACGA GAAGTTCGCG GAGATCCGCG AGAAGTACGT CGCCTACCTC
ACCACCCTGT TCGGCCTGGG GGAGCACCCC GATCCTGCTG CCGCGGCCGC GACGGTGCTC
GCCATCGACA CCCGGATGGC CGCGGGGCAC TGGGAGCGCG CCGAGACCCG CGACGTGCAG
AAGACCTACA ACCTGATGAC CAGGGCCGAG CTGATCGAGC TCAGCCCGGG CTTCGACTGG
GACGCCTACG TCACCAACCT CGGCGGCAAC GAGGAGACGC TCGCGGAGGT GTGCGTGCGG
CAGCCGTCGT ACTTCACCCA CCTCTCGGTG CTCCTCGACG AGATCTCCCT CGAGGACTGG
CGCGAGTGGC TGCTGGCGCA CGTGCTGCGG TCGGCGGCGG CGTACCTCAC CGACGACTTC
GTCGAGACGA ACTTCGACTT CTACGGCCGG ACCCTCAGCG GCACGCCCGA GCTGCGGGCG
CGGTGGAAGC GGGGGGTCGC GCTGGTCGAG GGCGCGATCG GCGAGGCGGT CGGCAAGGAG
TACGTCGCAC GGCACTTCCC GCCCCGGTCG AAGGCGATGA TGGACGAGCT GGTCGCGAAC
CTGCTCGCCG CCTACCGCCA GTCCATCTCC CGGCTCGACT GGATGACCGA GGAGACCAAG
CAGCGCGCGT ACGACAAGCT CGACAGGTTC CGGCCCAAGA TCGGCTACCC GGAGAAGTTC
CGCGACTACT CCGCGCTCCG GGTGACCCGC GACGACCTGC TCGGCAACGT CGCCGCCGCG
TCGGCGTTCG AGACCGACCG GCAGCTCGCG AAGATCGGCT CGCCGGTGGA CCGCGACGAG
TGGTTCATGC TCCCCCAGAC CGTCAACGCC TACTACAACC CCGGCACCAA CGAGATCTGC
TTCCCCGCCG GCATCCTGCA GAAGCCGTTC TTCTCCCCGG ACGCCGAGGA GGCCGAGAAC
TACGGCGGCA TCGGCGCGGT CATCGGCCAC GAGATCGGGC ACGGCTTCGA CGACCAGGGC
GCGCAGTACG ACGGCAGCGG CAACCTGCAC GACTGGTGGA CCCCCGACGA CAAGGCCGCG
TTCGAGGTGA AGTCGAAGGC CCTCATCGAG CAGTACGACG GCTTCGAGCC CCGCACGCTG
CCCGGCGAGC GCGTCAACGG CGCGCTCACC GTCGGTGAGA ACATCGGCGA CCTCGGCGGG
CTGACCATCG GCCACACCGC CTACCTGATC GCCCGCGGCG GGAGCGCGTC CGTCGAGGAC
CGGCAGAAGG TGTTCCTGAA CTGGGCCTAC TGCTGGCGGA CCAAGCGGCG CAAGGAGCAG
GAGCAGCAGT ACCTCACCAT CGACCCGCAC TCCCCGGCGG AGTTCCGTGC GAACATCGTG
CGCAACCTCG ACGAGTTCCA CGAGGTGTTC GGCACCGTCG AGGGGGACGG GCTCTGGCTG
GACCCCGACC AGCGGGTGCG CATCTGGTGA
 
Protein sequence
MSILDEARAG MSPEIRPQDD LFGHVNGRWL DETEIPADRS SWGPFIQLAD TAETQVHEII 
EDLARRVAAG EPVDEDAHKI GDLFASFMDT ETIARNGLRP VRPLIEAVAG LRDVRDLAAF
LGEFERIGGH GLFGSYVDTD SKNSDRYLFN LVQGGLGLPD ESYYRDEKFA EIREKYVAYL
TTLFGLGEHP DPAAAAATVL AIDTRMAAGH WERAETRDVQ KTYNLMTRAE LIELSPGFDW
DAYVTNLGGN EETLAEVCVR QPSYFTHLSV LLDEISLEDW REWLLAHVLR SAAAYLTDDF
VETNFDFYGR TLSGTPELRA RWKRGVALVE GAIGEAVGKE YVARHFPPRS KAMMDELVAN
LLAAYRQSIS RLDWMTEETK QRAYDKLDRF RPKIGYPEKF RDYSALRVTR DDLLGNVAAA
SAFETDRQLA KIGSPVDRDE WFMLPQTVNA YYNPGTNEIC FPAGILQKPF FSPDAEEAEN
YGGIGAVIGH EIGHGFDDQG AQYDGSGNLH DWWTPDDKAA FEVKSKALIE QYDGFEPRTL
PGERVNGALT VGENIGDLGG LTIGHTAYLI ARGGSASVED RQKVFLNWAY CWRTKRRKEQ
EQQYLTIDPH SPAEFRANIV RNLDEFHEVF GTVEGDGLWL DPDQRVRIW