Gene Noca_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1954 
Symbol 
ID4599860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2085206 
End bp2086603 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content69% 
IMG OID639776553 
Productputative lipoprotein 
Protein accessionYP_923151 
Protein GI119716186 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0503014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAAGC AGGAAGACCC TCACCACGTA CCACCACCCG ATCCTCGGGG CCGCGGTCGC 
GGCCCGCGAC GCTGGCTCGC CACCGTGGGC ATCCTCGCCC TCGGTGGGAT GGTGGCCCTG
GCCTACCAGG CAACGGCGGG TCCGCCGGCG GCGAAGCCGG CGACCGTCCA GAACCCCCCG
TCCGCCGGCG ACAACGCGCA GGCGATGGTC AAGGAGGGTC GACACACCTT CCGCTACGAC
ACGTTCGGGG ACCAGGCGTT CTGGGGCGGC ACGCTCCAGC TCCACGACGC GATCGCGGGG
GAGGACAACG GCGGCGTCGG TGGCGGCGTC AGCCCCAAGA CGGCGCTGGC CGTCGGGCTC
AAGGTCGACG TCAAGCGGCT GCCCGCCAGC GTCAAGAACG CACTCGCGAA CGGCAAGGTG
AACCTCGACG ACCCGGCGGT CACCCTGGCC CTGTTGAAGC TCAACTCGGT GGTCGGCGTG
AGGGGGTTCT TCAACTCCGA CGGCACCCTG CGGACCGTCG GCATCGAGTG CGCGCTGTGC
CACTCGACGG TCGACGACTC CTTCGCACCG GGCATCGGGA ACCGGCTCGA CGGCTGGGCC
AACCGGGACC TGAACGTCGG GGCGATCGTG TCCCTCGCGC CGAACCTGCA GCCGATCGCC
GATCTGCTGC ACACCGACGT CGACACGGTC AAGCAGGTCC TGGCCGCGTG GGGTCCGGGC
CGGTTCGACG CGCAGCTGTT CCTCGACGGG AAGGCCTTCC GTCCGGACGG TACGACGGCG
GCCACGGTGC TGCCGCCCGC GTTCGGGCTG CAGGGTGTCA ACCAGCACAC CTCGACCGGG
TGGGGCTCGG TGACGTACTG GAACGCCTTC GTGGCGAACC TGGAGATGCA CGGTCAGGGC
AACTTCTACG ACCCGCGTCT CGACAACGCC GACCAGTTCC CGATCGCGGC GGAGAACGGC
TTCGGCCACG TGCGGTCGAA GGTCGACAAG ATCTCGTCGA AGCTGCCGGC GCTCGCCGCC
TACCAGCTGT CGCTGACCGC GCCGACGCCG CCGAAGGGCA GCTTCGACCC GAAGGCGGCG
GCCCGCGGTG AGTCGCTGTT CATGGGACAG GCCCAGTGCT CGACCTGCCA CGTCCCGCCG
ACGTTCACCG AGCCGGGGTT CAACATGCAC ACCGGTGAGG AGATCGGGAT CGACAACTTC
CAGGCCGATC GCTCGCCGAC GCACATGTAC CGCACCAGCC CGCTCAAGGG TCTGTGGAGC
CACCAGAAGG GCGGCTTCTA CCACGATGGT CGGTTCCCGG AGCTGGTCGA CGTCGTCCAG
CACTACAACG ACACCTTCGG CCTGGGCCTC ACCGAGGCCC AGCAGGGCGA CCTCGTCCAG
TACCTGAAGT CGCTCTGA
 
Protein sequence
MVKQEDPHHV PPPDPRGRGR GPRRWLATVG ILALGGMVAL AYQATAGPPA AKPATVQNPP 
SAGDNAQAMV KEGRHTFRYD TFGDQAFWGG TLQLHDAIAG EDNGGVGGGV SPKTALAVGL
KVDVKRLPAS VKNALANGKV NLDDPAVTLA LLKLNSVVGV RGFFNSDGTL RTVGIECALC
HSTVDDSFAP GIGNRLDGWA NRDLNVGAIV SLAPNLQPIA DLLHTDVDTV KQVLAAWGPG
RFDAQLFLDG KAFRPDGTTA ATVLPPAFGL QGVNQHTSTG WGSVTYWNAF VANLEMHGQG
NFYDPRLDNA DQFPIAAENG FGHVRSKVDK ISSKLPALAA YQLSLTAPTP PKGSFDPKAA
ARGESLFMGQ AQCSTCHVPP TFTEPGFNMH TGEEIGIDNF QADRSPTHMY RTSPLKGLWS
HQKGGFYHDG RFPELVDVVQ HYNDTFGLGL TEAQQGDLVQ YLKSL