Gene Noca_3945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3945 
Symbol 
ID4598080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4154606 
End bp4156882 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content78% 
IMG OID639778550 
Producthypothetical protein 
Protein accessionYP_925129 
Protein GI119718164 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.404517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACGA CCGGGTTCCG CACCCTCGCC GACCAGCTGC GCAGCTGGCC GGACGAGCGG 
CTGTCGCGCC TGCTCGTCGC CCGGCCCGAC CTCGCCACGC CGGCGCCGCA CGACTCGGGC
CAGCTGGCCT CGCGTGCCGC GACCCGGTCC TCGCTGCTGC GGGCGCTCGA CCAGCTGACC
CGTGCCGAGC TCGCCGTGCT CGACGCGCTC GTCGTCGTCG GCCAGACCAC GCGCAGCGAG
CTGGCCGAGG TGGTGCGCGC GAGCACCGCC TCCGTCGCCG CGGCGGTCGA CCGGCTGCTC
GACCTCGCCC TGGGCTGGGA GGCCCCCGCG GGCCTGCGGG CGCTCAGCGC CGTGGCGGAC
ACGCTCGCCC AGAGCGGGGG CGCGGCGGGC GCCAGCGGGC TGCGCCCGGT CTCCGAGGAC
CCGCCACCAC CCGAGGAGGT ACGCCGGCGC CTGGACGAGC TGTCCCCCGC CGCGCGGGCG
CTGCTCGAGC ACGTGCTGGA CTCCGGCGGG GAGGCGACCA CTGGGTCGGC CCGGCACACG
GTGCTGCCCG AGGAGGCCGC GAGCCCCGCG GAGGAGCTGC TCGCCCGGCG GCTCCTCGTG
CCGCGCTCCG GGGGCGTGGT CGTGCTGCCC GGTGAGGTCG GCATCGCATT GCGCGGTGGG
CGGACGACCA CCGAGCCGAT CGACGAGGAG CCCGCGCTGG CCACCTCCGA GCGTGGTCCC
GCGATGGTGG AGCGCGCCGG GTCGGGCGCC GCCTTCGACG CGGTGCGCCG GGTCGAGCTG
CTGCTCGACC ACTGGGGCCT CCAGCCACCG ACCGCGCTGC GCAGCGGCGG GCTCGGCGTC
CGGGACCTGC GCGCGACGGC CGCCCACCTC CACGTCGACG AGCCGACCGC TGCGCTGCTG
GTCGAGGTCG CCGCGGCAGC CGGCCTGCTC GCGACCGCCG CGGACCCGGA CGGCAACCCT
GCTTGGATCC CCACCGACGC CTTCGACCAG TGGGCCGCGC AGCCTGTGGT CGCCCGGTGG
GGGGCCCTGG TCCGGGCCTG GCTGGACAGC CCGCGGCTGC CCGGCCTGGT CGGCACGCGG
GACCCGGCCG GCAAGGCCTG GAACGCGCTG GCCCCCGAGC TGGCCGGCGT GCACCAGGTC
GAGACCCGGC GGATGACCCT CGACGCGCTG CTCGAGCTGC CCGTGGGCGA GGTACTCGCG
ACCGGCACCG GGCTGCCGTC GCTGGTGGCC CGGATCGCCT GGCGCCGGCC GCGCCGGCCG
CGGACCCGCG CCGACCAGGT GGGCTGGGCG GTCGCCGAGG CGACCGCCCT CGGGGTCGTG
GCGCTCGGCG GGCTGCCGGC GTACTCCCGC CTCCTGCTCG GCGGCGACGA GCCCGCCGGG
CTGGCCGCGC TCGCGGAGCT GATGCCCGAG CCGGTCGACC ACGTGCTACT CCAGGCCGAC
CTGACCGCCG TCGCGCCGGG GCCGTTGGAG TCGCAGCTCG CCCGCCGCCT GCAGGTGGTC
GCCGACGTGG AGTCGCGCGG CGGGGCGACG GTCTACCGGT TCACGCCCGA CTCGGTACGC
CGCGCCCTCG ACACCGGCTG GACCGCGGCG GAGGTGCACG CCTTCGTCAC CACGGTCTCG
CGGACGCCGG TCCCCCAGCC GCTCACCTAC CTCGTCGACG ACACCGCCCG CACGTTCGGC
AGCATCCGGG TCGGCCACGC CGAGGCGTTC CTGCGCGCCG ACGACGAGGC CGCGCTCACC
GAGCTGCTCC ACCACCCGCG GTCGGCCGCG CTCGGGCTGC GCCGGATCGC GCCGACCGTC
CTGGTCAGCT CGACCCCGCT GGACGTGCTG CTCCCCCGGC TGCGCGAGCT CGGCGCCGCG
CCCGTGGTGG AGGCCGCCGA CGGCACCGTG CACGTCGCCC GCCCCGACCT GCTGCGGGCG
CGCACCCCCC GCGAGCACCG CGCCCGGGCC GCGCGGACCG CCCGGGAGAC GGCCCACGCC
GCGCACGCCG TCACCGCGAT CCGGGCCGGC GACCGCGCCG AGTCGAGCCG GCCCGACAGC
CCGGCGCGGG TCCTCACGCC GAGCGGCTCG ATGGCGGCGC TACGCGAGGC GGCCGAGGCC
GGCGAGACGG TGCTGATCGG CTACCTCGAC AACCAGGGCA CCCGCTCGGA GCGCCTGGTG
GACCCGGTCC GGGTCGAGGG CGGGTCGCTG ACGGCGTACG ACCACCGCTC CGACGACGTC
CGCACGTTCG CGGTGCATCG CATCACCACG GTCCGCGCAG TGACCGCCGA TCCGTAA
 
Protein sequence
MSTTGFRTLA DQLRSWPDER LSRLLVARPD LATPAPHDSG QLASRAATRS SLLRALDQLT 
RAELAVLDAL VVVGQTTRSE LAEVVRASTA SVAAAVDRLL DLALGWEAPA GLRALSAVAD
TLAQSGGAAG ASGLRPVSED PPPPEEVRRR LDELSPAARA LLEHVLDSGG EATTGSARHT
VLPEEAASPA EELLARRLLV PRSGGVVVLP GEVGIALRGG RTTTEPIDEE PALATSERGP
AMVERAGSGA AFDAVRRVEL LLDHWGLQPP TALRSGGLGV RDLRATAAHL HVDEPTAALL
VEVAAAAGLL ATAADPDGNP AWIPTDAFDQ WAAQPVVARW GALVRAWLDS PRLPGLVGTR
DPAGKAWNAL APELAGVHQV ETRRMTLDAL LELPVGEVLA TGTGLPSLVA RIAWRRPRRP
RTRADQVGWA VAEATALGVV ALGGLPAYSR LLLGGDEPAG LAALAELMPE PVDHVLLQAD
LTAVAPGPLE SQLARRLQVV ADVESRGGAT VYRFTPDSVR RALDTGWTAA EVHAFVTTVS
RTPVPQPLTY LVDDTARTFG SIRVGHAEAF LRADDEAALT ELLHHPRSAA LGLRRIAPTV
LVSSTPLDVL LPRLRELGAA PVVEAADGTV HVARPDLLRA RTPREHRARA ARTARETAHA
AHAVTAIRAG DRAESSRPDS PARVLTPSGS MAALREAAEA GETVLIGYLD NQGTRSERLV
DPVRVEGGSL TAYDHRSDDV RTFAVHRITT VRAVTADP