Gene Noca_2365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2365 
Symbol 
ID4595980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2520447 
End bp2521682 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content67% 
IMG OID639776964 
ProductHipA domain-containing protein 
Protein accessionYP_923557 
Protein GI119716592 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0940993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTCTG AGCCGTACCG GGCCCGTCGC GTCTTCGTCT GGACCTGGCT GCCCGGCGAG 
TCGACGCCCG TGGTGGCCGG GGCGGTCGAC CGCGTCGGCG CCGATCGTCT GGACTTCACC
TACGCCCGGT CCTACCTCGA CAACGAGAGG GCGATCAGCC TCTACACGCC CGAGTTGCCG
TTGCGTCGCG GAACCCAGGA GCCGATGGAC GGCCTGACGG TCGCTGCTTG TCTCAGGGAC
GCGACACCGG ACTCCTGGGG TGAGCGGGTG ATCGGCAACC GCCTCGGCTC GGGCGACACT
GAGCTCAGCG TCGAGACCTA CATGTTGGAG TCCGGCTCGA ACCGACTTGG TGCCATCGAC
TTTCAGGAGA GCCCCGAGGA CTACAGCCCG CGCGTGGACA CAGCCGGCCT GGACGAGCTG
TACGACGCCG CCGAGAAGGT CCTTGCTGGG GAGCCGTTGA ACCCGGCCAT CGGCGATGCT
CTGATGAACG GGACCGCCAT CGGTGGCGCG CACCCGAAGG TCCTGATCAG CGACGACTCG
GGAGTCGAGC ACCTCGCCAA GCTGTCGGTC TCCAGCGACG TGCATCCATG GGTTCGAGCC
GAGGCCGTCG CAATCGAGCT CGCACGACTC TGCGGCATCG AGGTGCCCAA GGCGCACGTC
ATCAAGTCGG TGGGCCGGGA GGTGCTGCTC ATCGAGCGGT TCGACCGTCC ACCCGGTGGC
CGGCGCCGTC ACGTCGTCTC GGGTCTGACG ATGCTCGGTT TCGACGCGCT CCTGGGTGCG
CGCTACGGGT CGTACCCCGA GATGCTCGAC GTCCTTCGTG AGTGGGGCCG CGCTCCACAA
GACATCGGCC ACCGGCTCTT CGAGCGCATC GTCTTCAACA TCGCGGTCGG CAACAACGAC
GATCATGCAC GGAACCACGC AGCGTTCTGG GACGGCACGA GCCTCGAGCT CACCCCGGCG
TTCGACCTGA CCCCGCAGCC CCGTTCGACC GACACGTCTG CCCAGGCGAT GGCCATCGGC
CGTGACGGAA GCCGAGCCAG CCGGTTCTCC GTGTGCGTGG CCGCGGCCGC TGACTACGGA
CTCTCCCGAG CCGAGGCAGG GCAGATCGTC GAGCGCATTG TCGCGACGGT CGAGAACCAC
TGGTCCGAAG CTGCGGACGC CGCCGGGCTG AGCGAGGCGG ATCGGAACCT GCTCTGGAAG
CGGTCCATCC TCAACCGTTC GTCCTTCTAC GACTGA
 
Protein sequence
MTSEPYRARR VFVWTWLPGE STPVVAGAVD RVGADRLDFT YARSYLDNER AISLYTPELP 
LRRGTQEPMD GLTVAACLRD ATPDSWGERV IGNRLGSGDT ELSVETYMLE SGSNRLGAID
FQESPEDYSP RVDTAGLDEL YDAAEKVLAG EPLNPAIGDA LMNGTAIGGA HPKVLISDDS
GVEHLAKLSV SSDVHPWVRA EAVAIELARL CGIEVPKAHV IKSVGREVLL IERFDRPPGG
RRRHVVSGLT MLGFDALLGA RYGSYPEMLD VLREWGRAPQ DIGHRLFERI VFNIAVGNND
DHARNHAAFW DGTSLELTPA FDLTPQPRST DTSAQAMAIG RDGSRASRFS VCVAAAADYG
LSRAEAGQIV ERIVATVENH WSEAADAAGL SEADRNLLWK RSILNRSSFY D