Gene Noca_4968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4968 
Symbol 
ID4595339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp301532 
End bp302650 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content71% 
IMG OID639772750 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_919410 
Protein GI119714268 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2856] Predicted Zn peptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0119707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGG ACGGCCCAGG CTTGTTCGAA CTCTCGCCCG GGAGGAGCGG CCGGTTCGAG 
CCGGCACGGC TGACCCAGGC GCGAGCGCGC CTCGGCGTCA GCAAGGCCGA CCTGGCTTCC
TCCGCCGGCG TCTCGGCCGC GGCGATCGGC CAGTATGAGG CCGGCGTGAC CTCGCCACGC
CCTGAGGTCG TCGATCGGCT GGCGGAGGCT CTCGAGGTCC GTCCCGGGTT CTTCGACGTC
GGCCGCCCGC TGGCGCGCAT CGACACGGTG AACGCGCACT TCCGTAGCCT GAGGTCAGTC
CGCGTCAGTG ATCGCCAGAA GGCCTTGGCC ACCGCCACGT TCGTGTGGGA GATGACCTTC
GCGCTTGAGC GGTACGTCAA GCTGCCCGAG GTGGACCTCC CTTCCCTCCC GGTCGGGACA
ACGCCGACCG AGGCCGCGGC GGCCCTGCGG CGGCACTGGG ACTTGCCTGA CGGGCCAGTG
AAACACCTTG TCGCCACGGC GGAGTCGCAC GGCGTCGTCG TTGCTGTGCG CCCGCTACGC
GAGATCGACG CCGTCGACGC CTTCTCTGCG GTCATCGTCG ATCGGCCGGT CATCATCACC
ACGCCGCGGC GCAGCGAGAA CGTGTTCCGG CACAGGTTCT CAATCGCCCA CGAGATCGGG
CACCTGTTGC TGCACGGCGA CTCCGGCGAG TACAGCGCGG CGGTCGAAAA GGAGGCCGAC
GAGTTCGCCG CCGCGTTCCT GACGCCGGCG GCCGCCATGG ACGCGGCGCT GCCGCAGCGG
CTCGAGCTGG CGGCACTGGA CCGGCTCGGT CGGACGTGGG GCGTTTCGCC GAAATCGCTG
GTGCGCCGGA TGGTCGAGCG CGGGCGCACC ACCGAGTCGT CGGCACGGCG GGCCTACCAG
CGCCTGGCCA TGACCGACGA CCCGTCGGCC GACCCGACCA GGGCGTACCC GGGCGAAATG
CCATCACTGC TGAAGAAGGC CGCGGACATG GCGGGCGACC TCGGCGCGGG AGTGCCTGCC
CTCGCCGAGG CGCTGAAGCT CAGGCCCGTG CAGGTGCGTG ACCTGCTCGG CGACGCCGAC
CAGCGGCCGG TCCTACGCCT TGTCGACGGC CGGGGCTGA
 
Protein sequence
MPEDGPGLFE LSPGRSGRFE PARLTQARAR LGVSKADLAS SAGVSAAAIG QYEAGVTSPR 
PEVVDRLAEA LEVRPGFFDV GRPLARIDTV NAHFRSLRSV RVSDRQKALA TATFVWEMTF
ALERYVKLPE VDLPSLPVGT TPTEAAAALR RHWDLPDGPV KHLVATAESH GVVVAVRPLR
EIDAVDAFSA VIVDRPVIIT TPRRSENVFR HRFSIAHEIG HLLLHGDSGE YSAAVEKEAD
EFAAAFLTPA AAMDAALPQR LELAALDRLG RTWGVSPKSL VRRMVERGRT TESSARRAYQ
RLAMTDDPSA DPTRAYPGEM PSLLKKAADM AGDLGAGVPA LAEALKLRPV QVRDLLGDAD
QRPVLRLVDG RG