Gene Noca_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1037 
Symbol 
ID4599698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1091016 
End bp1093673 
Gene Length2658 bp 
Protein Length885 aa 
Translation table11 
GC content61% 
IMG OID639775636 
Productpentapeptide repeat-containing protein 
Protein accessionYP_922243 
Protein GI119715278 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGAC GACCAAGTAT GGCCTCCAGT TTGAGCAAGT GGCGACAAGC ATTCGAGAAA 
CTCGATCAGA CGCCCTGGCC GGGCACGAAT CGAATCGACG ATCCGACCCA GTTGCGTGGC
CGCAAGCGCG ACGTTTCCGA TATCGCTGTG GCCTGCCTAG CAAATGACCT CCTTGTCATC
CACGGAGCGT CCGGGGTCGG AAAGTCCTCG CTCCTCACAG CTGGGCTCAT TCCTGAGTTG
AGGCGACGAC GGAAGACCGT CGTGTACTGC AATCGCTGGG ATGCTCCTGA TGACAGCGTG
GCTCCGTCTG CGCACATCAC TGAAGGTGTG CTGGAGGCGG ACCCGTTCTC ACTCCCGAAC
GGCGAGTTTG AATCTAGATT CGGTGATCGA CTGGTGATCG TACTGGACCA GTTCGAAGAA
GTGATTCGAA ACAATCCCGA GTTCGCGCAA CGCGTGCTTC GCTGGATCGA AGACGTAGTC
GGCACGACCT CGGCCAGATT CGTCGTTTCG CTTCGGTCTG AGCAGGAGCA CGAGCTCGCC
GGGCTGTATA CCAAGCCGTT CGCGCGTCGC GGACGCGTCG AGATCCCGGC AATTACGAAC
CCAAGGATCA TCGAACTCAT CATCGGGGGC CCCCGTGACT CGTCCACGGA GCAGAGCACC
GAAGGACGCC GACTTCCGAT CGCGGACGAC GCGATCGATG CTCTTCGAGA GGCCTGGCAA
GCATCACAGG CGGATGAGAA CTCAACGAAG TGGGATCGGC CCGGTCTCCT GCACCTCCAA
GCCGCTCTCT ATGTCCTCTG GATGCGTAGA TCTGCAAAGT CCGGCGGCGA CGATGGCCTT
GGGCACATCG CGCTCACCGA TGTCACGGGC CTCATCCGCG AAGTAGCCAA GAGGCATGGT
CTCGGCCGCG CGTCTGCACA GGCGGCACTG CTCGCATATG CGCTTGAGCT GAGCGTGTCG
TGGAAGGTCG AGAACTGCGA GTCGGCGTGC ATGAGCACGC GGACCTGGCA GGGGGTTCCG
GCTGCAATCG TCAATCAGAC CAAATGGATC TTCCGCGACA TCACCGAGCA TCTGTCGAGC
GGCGGTTACA AGACTCCGCG AGACATGTGG GAACTCTCGA GAGAGGTCAT CGGGCACCTT
CACCGGCCTG CGCATGCTGC CGTTCAGCCG GTCGCTCAGA AGCTCTACAA CGGACTCGAT
CCCGAATGGC TTAGTGCACG CGAGGTCGTG CAGACGGAGG CATCGTCGGG CGAAGGAAAT
CAGTACGGCC TGCAGCCAGA CTGGCTTGGC GCTGAACGCC CATCCTTCAG CGCCGACATG
CGCGACGGCG GACCGTCGCC GCGCCAACTT GGTTCTGGGC CCGCGGCCGG CCTCACAAAC
ACCGATGTGA TGTTCGAACT CTTCCGGTGC TATTTCTTCG CGCTCGAATG GCTGAAGCAC
GCGAAGATCG CACAGTTAGA AAGCAAGCGC GACAGCAAGA TCGTAATTCT GACGCACGAT
CGCTACTCGG CCGGACTTGC CCGCTGGCAC GGAAGCCAGG TCGGCAGCTT CAAGGAAGCC
GTGGAACGTC TTGCCTCCCA CCGCGGCGAG GATCTGGCAT GGCACGATGT GGGCGGAAAG
GTGCGTTCTG CAGCCGCGAC CCGACTAGTT GTGAACGCCA ACTGGCGCTC GTGCGCCATC
CACGACACAG ACTTCTCGGG CGTGACTTTC GTGAACTGCG ACTTTGCAGG TTCAACATTC
GAGAGTTGCG TCTTCGACGG AGCGACATTC GTCAACTGCA TCCTCGACCA GGTCGATTTC
GTGCGATGCG CCATCAAGGG GCGCCCGACC TGGCCCGAGA AAGCAGTGCT CGACCGGCTT
GCCGACGAGG CTGTCGCGAA GGCGCCCGAG TTCAGACTGG CTGCCCCGAG CGAGCTGACT
GACGCGCTTC GCGCGCTGCA GGCTCCTAGT TCGACGCGCA TAACCTCAAC GACTCACTTC
CACCTGTATG CAAGGGAGTC TGGCACCCCC GCAGTCACTG CGTCGGGAAG GGCGCCAGCG
CCAAAACCCA CCAGGGAGCC GACGCTTCCG CTCAAGCCCG GCGGTCTCAC TGTCTGCGGC
GGCCGGCTAA GTTCGCTCAC GTTTCGCACA TGCGACTTCC TTGGACCCAA CGCGACGGTC
AGCCTCCACC ACATCGCGGG GACTTCTTTG GAGATTTGCG AACAGCGCGT TGGCACGTTC
GACATCTTCG CAGCGGGTAT CCGAGGCCTA ACGGTCACCC GCCCCGTGGA GGACCTCGAT
GAAGCCGCAC CTAGTGGCGC CTCCTCCAAC CGCGGCGGAC CACGACAGTT CACGCTCAAT
GTTCATCGTG CCCGGGTAAT CAATGCCTGG TTCGGTGTAA ACCTCAAGGG CAAGGCCTCA
TTCGATGACT GCCTCATTCT CCAACTCGTC AACGCAAGCG AGTCCTTTAC GCCAACGCTC
TTGCGATCAA GATACTTCGG CTTGGTCAAC GCTGAGACTC CCAAGGATTT GCTGGTGGAA
CCCAAGGGCT CAATCGAGAT CGCGGATGCG GGTATCGAGA CTCTGGGTGG TTTGGCGCCT
GAGCTGGTGG GCCTGAGCAG AAACATCGAC TTCCGGGAGT CAGTGCCCGA CCTCGCCGTG
GACTCGAAGG AGGAGTGA
 
Protein sequence
MTGRPSMASS LSKWRQAFEK LDQTPWPGTN RIDDPTQLRG RKRDVSDIAV ACLANDLLVI 
HGASGVGKSS LLTAGLIPEL RRRRKTVVYC NRWDAPDDSV APSAHITEGV LEADPFSLPN
GEFESRFGDR LVIVLDQFEE VIRNNPEFAQ RVLRWIEDVV GTTSARFVVS LRSEQEHELA
GLYTKPFARR GRVEIPAITN PRIIELIIGG PRDSSTEQST EGRRLPIADD AIDALREAWQ
ASQADENSTK WDRPGLLHLQ AALYVLWMRR SAKSGGDDGL GHIALTDVTG LIREVAKRHG
LGRASAQAAL LAYALELSVS WKVENCESAC MSTRTWQGVP AAIVNQTKWI FRDITEHLSS
GGYKTPRDMW ELSREVIGHL HRPAHAAVQP VAQKLYNGLD PEWLSAREVV QTEASSGEGN
QYGLQPDWLG AERPSFSADM RDGGPSPRQL GSGPAAGLTN TDVMFELFRC YFFALEWLKH
AKIAQLESKR DSKIVILTHD RYSAGLARWH GSQVGSFKEA VERLASHRGE DLAWHDVGGK
VRSAAATRLV VNANWRSCAI HDTDFSGVTF VNCDFAGSTF ESCVFDGATF VNCILDQVDF
VRCAIKGRPT WPEKAVLDRL ADEAVAKAPE FRLAAPSELT DALRALQAPS STRITSTTHF
HLYARESGTP AVTASGRAPA PKPTREPTLP LKPGGLTVCG GRLSSLTFRT CDFLGPNATV
SLHHIAGTSL EICEQRVGTF DIFAAGIRGL TVTRPVEDLD EAAPSGASSN RGGPRQFTLN
VHRARVINAW FGVNLKGKAS FDDCLILQLV NASESFTPTL LRSRYFGLVN AETPKDLLVE
PKGSIEIADA GIETLGGLAP ELVGLSRNID FRESVPDLAV DSKEE