Gene Noca_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1747 
Symbol 
ID4597929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1857894 
End bp1859630 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content69% 
IMG OID639776347 
Producttranscription termination factor Rho 
Protein accessionYP_922947 
Protein GI119715982 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.179757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGG CTCAGCTGGT CGAGGCGATC AAGGCCCACC AGAGCGGCGG CCGGCCGGCC 
AAGGAGCGCG GCGAGCAGCA GCAGGCCCAG CAGGAGCGGA GCGAGCAGCA GCGGACCCAG
CCGGAACAGG CCCCGCAGGA GCGGACCCAG CCGGAGCGAC CGCAGCAGCA GCGGTCGGAG
GAGCAGCCGC GGCGCGACCA GCGGCCCGAC GAGGCCCAGC GGGACCAGCA GCGGGACCAG
CAGCGCGAGC GCAACCGCGG CCAGGGTCAG GGGAACCACG ACCAGAAGCA GGACCAGAAC
AAGCAGGGCC AGCCCAAGCG GCAGGACCAG AAGCAGGCCC AGAACAAGCA GGACCAGAAG
CAGGACCAGG GCAAGCAGGA CCAGAAGCAG GACCAGGGTC ACCAGGATCA GGACCATCAC
GACCAGGGAC ACCAGGACCA GGGGCACCAG GACCAGGGCC GGGCGGGCGA GGACCAGGGT
GAGGGCAGTC GCCGCAACCG GCGGCGTCGC GGTCGCGACC GTGACCGTAC CGGTCGGGGG
GTCACCGGCG GTGGTCAGCG CAACGAGCCG GACACCACGA TCCTCGAGGA CGACGTCCTG
GTGCCGGCCG CGGGCATCCT CGACGTTCTC GACAACTACG CGTTCGTGCG GACCAGCGGC
TACCTCCCCG GCCCTGACGA CGTGTACGTG TCGCTCTCGA TGGTGCGCAA GTTCGGGCTG
CGCCGCGGCG ACGCGCTCGT CGGGCAGGTG CGCCAGCCCC GGGAGGGCGA GCGCAAGGAG
AAGTTCAACC CGATGGTCCG CATCGACAGC GTCAACGGCG CCGATCCGGA GATCGCGAAG
GGGCGGGTCG ACTTCGCCAA GCTGACCCCG CTCTACCCCT CCGAGCGGTT GCGGCTGGAG
ACCGAGCCGA CGAACCTGAT CGGTCGGGTC ATCGACATCG CGGCCCCGAT CGGCAAGGGC
CAGCGCGGCC TGATCGTGTC CCCGGCGAAG GCCGGCAAGA CCATGATCAT GCAGTCGATC
GCGAACTCGA TCACCACCAA CAACCCCGAG TGCCACCTGA TGGTGGTGCT GGTCGACGAG
CGGCCCGAGG AGGTCACCGA CTTCGAGCGC TCGGTCAAGG GTGAGGTCAT CTCCTCGACC
TTCGACCGTC CGGCCAGCGA CCACACGATG GTCGCCGAGC TCGCCATCGA GCGGGCCAAG
CGGCTGGTCG AGCTCGGCCA CGACGTCGTC GTACTGCTCG ACGGCATCAC CCGGTTGGGG
CGCGCCTACA ACCTCGCGAT GCCGGCGAGC GGCCGGATCC TCTCCGGTGG TGTGGACTCG
GCCGCGCTCT ACCCACCGAA GAAGTTCTTC GGTGCGGCGC GCAACATCGA GAACGGCGGC
TCGCTGACCA TCCTCGCCAC GGCCCTGATC GAGAGCGGCT CGAAGATGGA CGAGGTGATC
TTCGAGGAGT TCAAGGGCAC CGGGAACATG GAGATCCGGT TGCGCCGCGA CCTTGCCGAC
AAGCGACTGT TCCCCGCGAT CGACGCGGTC CAGTCCGGCA CCCGCCGCGA GGAGCTCCTG
ATGAGCAAGG AGGAGCTGGC CATCGTCTGG AAGCTGCGCC GGGTGCTCTC CGGGCTCGAC
GGCCAGCAGG CGCTCGAGCT CCTGCTGGAG CGGCTGAAGA AGTCCCAGAC CAACATCGAG
TTCCTGATGC AGGTCCAGAA GACGACCCCG ACCCCGACCG GCGGGCGCGA AGACTGA
 
Protein sequence
MKKAQLVEAI KAHQSGGRPA KERGEQQQAQ QERSEQQRTQ PEQAPQERTQ PERPQQQRSE 
EQPRRDQRPD EAQRDQQRDQ QRERNRGQGQ GNHDQKQDQN KQGQPKRQDQ KQAQNKQDQK
QDQGKQDQKQ DQGHQDQDHH DQGHQDQGHQ DQGRAGEDQG EGSRRNRRRR GRDRDRTGRG
VTGGGQRNEP DTTILEDDVL VPAAGILDVL DNYAFVRTSG YLPGPDDVYV SLSMVRKFGL
RRGDALVGQV RQPREGERKE KFNPMVRIDS VNGADPEIAK GRVDFAKLTP LYPSERLRLE
TEPTNLIGRV IDIAAPIGKG QRGLIVSPAK AGKTMIMQSI ANSITTNNPE CHLMVVLVDE
RPEEVTDFER SVKGEVISST FDRPASDHTM VAELAIERAK RLVELGHDVV VLLDGITRLG
RAYNLAMPAS GRILSGGVDS AALYPPKKFF GAARNIENGG SLTILATALI ESGSKMDEVI
FEEFKGTGNM EIRLRRDLAD KRLFPAIDAV QSGTRREELL MSKEELAIVW KLRRVLSGLD
GQQALELLLE RLKKSQTNIE FLMQVQKTTP TPTGGRED