Gene Noca_4920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4920 
Symbol 
ID4595296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp252051 
End bp253688 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content74% 
IMG OID639772703 
ProductLuxR family transcriptional regulator 
Protein accessionYP_919363 
Protein GI119714221 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2197] Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAACGA TTGAGCGCCT CGCAACCGCG CGTGCTGCCG CCGAGGCCGG ACGGTGGGAG 
GAGGCGCGGG CAGCGTTCGC CGCGGCGGCT GCCGTGGAAG AGTCCGCGGA CGCGGTCGAC
GGACTGGGCC GGGTGTGTTG GTGGCTGGGC GACGTGCGCT CCGCGATCCG CCACCGGGAG
CGTGCCTTCA CTCTGCTGCG CCAAGCGGGC CGTGACGACG AGGCGACCAT CGCGGCCCTC
GACCTGTGCA TCTGGTACCT CACGAACCTC GAGAACGAGG CGGCCGCCGG CGGCTGGCTG
GCCCGCGCCG CACGGGCCGC CGAGCACACC ACGGACCCGA TGGTCCGGGG CTGGCTGGTA
TTGATCGGCG CCTACCTGTG CTCCGACGCC ACCGCGCGAC GTGCCGGGCT CGAGGAGGCC
CTCCGGATTG CGGCCGAGGC CTCTGACGAC GGGCTGCACG CGATGGCGCT GGCCGACCTG
GGTGTGCTCC TCGTGGCTGG CGGCGAGGTG GAGCACGGGA TGGCGCTCCT CGACGAGGCG
ATGGCGACCA CCCTGGGTGG CTTCGGCGGC CGGCTGGAGG TCGTGGTCTG GTCGAGCTGC
AACATGCTCG CCGCGTGTAG CCTGGCCCAG GACCTGCGAC GCGCGACGCA GTGGTGCCGG
GTGGCCGAGG AATTCACCCA GACTTACGGC TGCCCTTTTC TTCAAGCCAG GTGTCGAGCC
CACTACGGCT CGGTCCTGGT CGCCGCCGGC ACCTGGGATC TCGCTGAGCC CGAGCTCCGG
CGGGCAATTT CCATGTCGGA GGACGTCGGG CGCCAGCCCC TGCTCGAGGC GAGGACCGCA
CTTGCCGCCC TCCGGCTCCG GCAGGGTCGG CTCGGCGAAG CCACCGAACT CGCCGAGGAG
CTCGATACGA ACTCCCCCGC CGCCGCCCTC GTATCGGCCG AGGTGCGGCT CGCCGCCGGC
AGGCCGGACG AGGCCGCGGC GCTGCTCCGC GCCGCGCTCG GGCTGCTCCA TCCCGACGAT
CCGCAGAGCG ACCCGCTCGC GGCCGCCCTG TGTGAGGCGT ACCTGGCCAC CGGCGACATT
GCGGGTGCCG AGGCGGCGCT GGCGGGCAGC CGGGCCGAGC CGCCGAGGCC CGCACTGCCC
CGAGGGAGTG CTCAACGGAC CCGCAGCGCG GGGCTGGTCG CGGCGGCATC GGGCGACGCT
GCGACCGCGG TGCGACGGCT GGCCGAGGCG CTCGCCGCCT TCGAGCGGCA CGACCTGCCC
TTCGAGGCGG CCCGGACCCG GCTGGATCTG GCCAGGGCCC TCGCCACGCA CGATCCCGAA
GCCGCCGCGG GCTACGCCAC CGAAGCGCTT CGCGCCCTCA GGCGGCTGGG CGCAGCCGGC
GAAACGGCCG CAGCGGCCGC GTTGCTCCGC CAGCTCGGAG TCACCCCCGG GCCGGAGCCC
CGGGACCCCG GGGTGCTCAC CAGGCGCGAG CACGACGTGC TCACCCTGCT CGCCGACGGA
CTCAGCAACC CGGAGATCGC GCAACGCCTA TACCTGAGCC GCAAGACCGT GGCGCACCAC
GTGAGCAGCA TCCTGACCAA GCTTGCCCTC CGCTCCCGCG CCGAGGCCGC CGCCTTCGCC
ACGCGGACCC GGGGGTGA
 
Protein sequence
MATIERLATA RAAAEAGRWE EARAAFAAAA AVEESADAVD GLGRVCWWLG DVRSAIRHRE 
RAFTLLRQAG RDDEATIAAL DLCIWYLTNL ENEAAAGGWL ARAARAAEHT TDPMVRGWLV
LIGAYLCSDA TARRAGLEEA LRIAAEASDD GLHAMALADL GVLLVAGGEV EHGMALLDEA
MATTLGGFGG RLEVVVWSSC NMLAACSLAQ DLRRATQWCR VAEEFTQTYG CPFLQARCRA
HYGSVLVAAG TWDLAEPELR RAISMSEDVG RQPLLEARTA LAALRLRQGR LGEATELAEE
LDTNSPAAAL VSAEVRLAAG RPDEAAALLR AALGLLHPDD PQSDPLAAAL CEAYLATGDI
AGAEAALAGS RAEPPRPALP RGSAQRTRSA GLVAAASGDA ATAVRRLAEA LAAFERHDLP
FEAARTRLDL ARALATHDPE AAAGYATEAL RALRRLGAAG ETAAAAALLR QLGVTPGPEP
RDPGVLTRRE HDVLTLLADG LSNPEIAQRL YLSRKTVAHH VSSILTKLAL RSRAEAAAFA
TRTRG