Gene Noca_1530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1530 
Symbol 
ID4595681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1622315 
End bp1625329 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content69% 
IMG OID639776128 
Producthypothetical protein 
Protein accessionYP_922731 
Protein GI119715766 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGC TGTTCGACGA GGCCCCTCGG GACCCCGGAC CCCCGGCGCG GTCCGGTTCG 
CGCCGGTCCC GTGCCCTGAT CGTCACGGCC GTGGTGCTGG TGATCGGCTT CCTGGGGCTG
AGCACGTTCG CGGGCATCTA CACCGACCGG CTCTGGTACG TCTCGGGCGG GTACGGCGCG
GTCTTCACCA CGCTGTTCTG GACCAAGACC GTGCTGTTCT TCCTCTTCGG CGCGGGCATG
GCGCTGGTGG TCGGCGTGAA CATCTACCTG GCCTACCGGT TCCGGCCGTT CTTCCGCCCG
AACTCACCGG AGCAGAACGG GCTGGACCGC TACCGCGAGG CCATCAACCC GATCCGGACC
TGGCTGCTGG TGGGCGTCGC GCTCGTGCTC GGCGCGTTCG CCGGCAGCTC GGCGATCGGC
GAGTGGCGCG ACTACCTGCT GTGGCGCAAC GGCACGTCGT TCGGCAGCGA GGACGCCTAC
TTCCAGAAGG ACATCGGCTT CTACGTCTTC GACCTCCCGT GGCTGCACTA CCTGGTCGAC
TACGCGATGG CCGTCCTGGT CGTCGCGCTG ATCGCCGCCG CGGTCGTGCA CTACCTGTAC
GGCGGGATCC GGCTGCAGAC GCCCCGCGAC CGGCTCTCCG GGGCCGCCCA GGCGCAGATC
TCGGTGCTGC TCGGGTTCTT CGTGCTCGCG AAGGCCGCCG ACTACTGGCT GGACCGCTTC
GACCTGGTCA GCCAGGGCGG CGGCGTGATC ACCGGCATGA CCTACACCGA CGACCACGCG
GTGCTGCCGG CCAAGAACAT CCTCCTCGGC ATCTCGATCA TCTGCGCCGT GCTCTTCTTC
GTGAACGTGT GGCGCCGCAC CTGGCTGCTG CCCTCGGTCG GCCTGGCCCT GCTCGCCGTC
TCGGCGATCC TGCTGGGGCT GATCTGGCCG GGCATCGTGC AGCAGTTCCA GGTCAAGCCC
TCCGAGGCGG ACAAGGAGGC GCCGTACATC GAGAAGAACA TCGAGGCGAC CCGCACCGCC
TACGACGTCG CCAACGTCGA CGTGGAGAAG TACGACCCGG CGACCGCGCT CGGCGCCGGC
TCCGCGAGCA TGGTCGAGGA GGAGACCTCC TCGGTGCCGC TGGTCGACCC GCAGCTGGTC
CGGGACGCCT TCGAGCAGAA CCAGCAGGTG CGGGCCTACT ACTCGGTCGC CCAGGTCCTC
GACGTGGACC GCTACGACAT CGACGGCAAC GACCGGGCGC TCGTGCTCGG GGTCCGGGAG
CTCGACCAGA GCGGCATGAA CGCCGGCGAC CGCAACTGGA CCAACCTGCA CACCGTCTAC
ACCCACGGCA ACGGCATCAT CGCGGCGTTC GCCAACCAGC GCAGCGAGGA CAACAAGACC
CAGATCGACA ACGCGGACAA CACCGGTGAC CAGGCCGGCA TCGTGTGGGC CCAGGGCACC
AACGCCGGGC AGGACGCCCT CGCTCGTGCC ACCGGCGGCT TCGAGGACCG GATCTACTAC
GGAGAGCAGA GCCCGCAGTA CTCCGTGGTC GGCAAGGCGA CGCCGGACTC CACCGACGTC
GAGCTGAACC TGCAGACGGC CGGGTCCGAC GAGGGCTCGA CGACGACGTA CGACGGCAAC
GGCGACGCCA GCGTCGGCGG GTTCTTCAAC CAGCTGATGT TCGCCACCAA GTTCGGCGAG
CCGAACTTCC TGCTCTCGGG GCGCGTGAAC CCCAACAGCA AGGTGCTGTT CAACCGCAAC
CCGGCCGACC GGGTCGAGAA GGTGGCGCCC TGGCTGACCG TGGACAGCGA CCCCTACCCG
GCCGTGGTCG ACGGCCGGAT CCTGTGGATC ATCGACGGCT ACACCACCAC CGACCGCTAC
CCGCTGTCGG AGAAGGAGTC GTTCCAGACG ATGATCGACG ACTCCCTGCA GGAGGAGACC
GGGCTGCGCA CCCTGCCGAC CGACGAGATC AACTACATGC GCAACGCCGT GAAGGCCACC
GTCGACGCCT ACACCGGTGA CGTCACGCTC TACGCCTGGG ACGAGGAGGA CCCGATCCTG
CAGGCCTGGC GCAGCGCGTT CCCCGGCACC GTCGAGGACA AGTCGGAGAT CTCCGACGAT
CTGCTCGACC ACCTGCGCTA CCCCGAGGAC CTGTTCCAGG TGCAGCGCTA CCAGTTCGCC
CGCTACCACG TGACCGAGCC GATCGACTTC TACCAGGGCA ACAACCGCTG GCAGGTGCCC
GAGGATCCCT ATTCAAAGGG CAAGTTCCAG CCGCCCTACC GGCTCTTCGT CGACAGCAAC
GGCGGCACCG ACCAGGTGTT CGCACTGACC TCGGTCTACG TCCCCTACAA CAAGAACAAC
CTCGCGTCGT TCGTCTCGGT GAACGCGGAT GCGACCAGCG ACCAGTACGG CCAGATGCAG
GTGCTCGAGC TGCCCAACGA GCAGACGCCG GGCCCCGGCC AGGTCGCCAA CCAGTTCGCC
ACCGACCCGG AGGTCGCCAA CGAGCTGGCC CAGTTCAACC GCAGTGGCGC GCGGCCGGTG
TACGGCAACC TGCTGACGCT GCCGATCAAC GACGGGCTGA TGTACGTCCA GCCGGTGTAT
GCGACCCAGG CCCTCTCGGA CTCGAGCTTC CCGATCCTGC GCTACGTGCT GGTGAAGTAC
GGCAACGACA TCGGCTTCGG CTCGACGCTG CGCGACGCCC TGGAGAACCT CCTCGGCGTC
AGCACCGGCC CCGGCACCCA GCCCCCGGAC ACCGGCCAGC CCGGCGACAA CGAGAACCCG
CCGCCCGCCA CCGGCACCGT CGCCGCGCAG ATCCGCGCCC TCCTCGCCCA GGCCCAGGAC
GCCTTCGACG CCGCCGACGC GGCGCTGGCC GACGGGAACC TCGCCGAGTA CCAGCGCCAG
ATCGGCATCG CCCAGGCCAA CGTCGAGGCC GCCATGGAGC TCGGCCAGAA GCGCGGCTCG
GCCGGTCAGC CGTCGGGCTC GCCCTCGGGG TCCGCGTCGT CCTCCCCCTC GGAGTCGCCG
AGCCCGTCCT CCTGA
 
Protein sequence
MSELFDEAPR DPGPPARSGS RRSRALIVTA VVLVIGFLGL STFAGIYTDR LWYVSGGYGA 
VFTTLFWTKT VLFFLFGAGM ALVVGVNIYL AYRFRPFFRP NSPEQNGLDR YREAINPIRT
WLLVGVALVL GAFAGSSAIG EWRDYLLWRN GTSFGSEDAY FQKDIGFYVF DLPWLHYLVD
YAMAVLVVAL IAAAVVHYLY GGIRLQTPRD RLSGAAQAQI SVLLGFFVLA KAADYWLDRF
DLVSQGGGVI TGMTYTDDHA VLPAKNILLG ISIICAVLFF VNVWRRTWLL PSVGLALLAV
SAILLGLIWP GIVQQFQVKP SEADKEAPYI EKNIEATRTA YDVANVDVEK YDPATALGAG
SASMVEEETS SVPLVDPQLV RDAFEQNQQV RAYYSVAQVL DVDRYDIDGN DRALVLGVRE
LDQSGMNAGD RNWTNLHTVY THGNGIIAAF ANQRSEDNKT QIDNADNTGD QAGIVWAQGT
NAGQDALARA TGGFEDRIYY GEQSPQYSVV GKATPDSTDV ELNLQTAGSD EGSTTTYDGN
GDASVGGFFN QLMFATKFGE PNFLLSGRVN PNSKVLFNRN PADRVEKVAP WLTVDSDPYP
AVVDGRILWI IDGYTTTDRY PLSEKESFQT MIDDSLQEET GLRTLPTDEI NYMRNAVKAT
VDAYTGDVTL YAWDEEDPIL QAWRSAFPGT VEDKSEISDD LLDHLRYPED LFQVQRYQFA
RYHVTEPIDF YQGNNRWQVP EDPYSKGKFQ PPYRLFVDSN GGTDQVFALT SVYVPYNKNN
LASFVSVNAD ATSDQYGQMQ VLELPNEQTP GPGQVANQFA TDPEVANELA QFNRSGARPV
YGNLLTLPIN DGLMYVQPVY ATQALSDSSF PILRYVLVKY GNDIGFGSTL RDALENLLGV
STGPGTQPPD TGQPGDNENP PPATGTVAAQ IRALLAQAQD AFDAADAALA DGNLAEYQRQ
IGIAQANVEA AMELGQKRGS AGQPSGSPSG SASSSPSESP SPSS