Gene Noca_3907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3907 
Symbol 
ID4598042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4112019 
End bp4113107 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content73% 
IMG OID639778513 
Producthypothetical protein 
Protein accessionYP_925092 
Protein GI119718127 
COG category[A] RNA processing and modification 
COG ID[COG5178] U5 snRNP spliceosome subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.837728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGGGCG ACCAGTACCC TCCCCCGGGC GGATCGCCCC CGAGCCCGCC CCCGCCCGGA 
TGGCAGCCAC CGCCGCCCCC GACACCCGTC CCGCCGGCGC CCGGCTGGGC GCAGTCGCCA
GTCGTCGCGC CGGGCATGCT GGGCGCCGCA CACAAGCCGG GTGCGATGCC GCTGCGCCCG
CTCGCGCTCG GCGACATGTA CGACGCGGCG TTCCGGATCA TCCGGTTCAA CCCGAAGGCC
ACGGTCGGCT CCGCGGTGCT CGTCGCGGCG GTCGCGATGG GCGTCCCGGT CCTGGTCACC
GCGCTGCTGA CGCTGGTCGT GGATCTCTCC GCCGCACAGT CCGGCGACGA CCTCTCCACT
GCCGAGGTCG TCGGCTACGC CGGGTCGATC GGCTCGCTGT TCCTCGGCAC CGCGCTGCAG
TCGGTGGGCT CGATCCTGGT CACCGGCATG GTCGCCCACG TGACGGCGGC CGCGGCGATC
GGCCGCCGGC TCACCCTCGG CGAGGCCTGG GCCGCGACCC GGGGCTCGCG CTGGCGGCTG
GTCGGGCTGA CACTGCTGCT GGGCCTGATG CTGGCCGGGC TGCTGCTCGC CTACGGCCTG
CTCTGGATCC TGGCGGTCGT GCTGCTGCCG ACCTGGGCGA TCGTCCTGTT CGGCGTGCTC
AGCGTCCCGG CGTTCCTCGC GTTCGCCTGC TGGTTCTGGA TCCGGGTCTA CTACCTGCCG
GTGCCGGCCC TGATGATCGA GCGGACCCGG GTGCTGGCCG CGATCGGGCG GGGCTTCCGG
CTGACCCGGC GGCAGTTCTG GCGGACCTTC GGGATCGCGC TGCTGACCGT GATCGTCACG
GGGATCGCCG GGAGCATGCT GTCGGCGCCG TTCACGATCG CCGCTCAGCT GCTGCCGCTG
GCGATGGCCG AGTCCCGCTA CGCCGTACTC GTCCTGGTCG TGCTGAGCGC GATCGCCACC
GTCGTCCAGA CCGCGTTCGT CACCCCCTTC ACCTCCGCGG TCACGAGCGT GCAGTACCTC
GACCAGCGGA TCCGCAAGGA GGCCTACGAC GTCGAGCTGA TGACCCAGGC CGGGATCACC
GCGTCGTGA
 
Protein sequence
MSGDQYPPPG GSPPSPPPPG WQPPPPPTPV PPAPGWAQSP VVAPGMLGAA HKPGAMPLRP 
LALGDMYDAA FRIIRFNPKA TVGSAVLVAA VAMGVPVLVT ALLTLVVDLS AAQSGDDLST
AEVVGYAGSI GSLFLGTALQ SVGSILVTGM VAHVTAAAAI GRRLTLGEAW AATRGSRWRL
VGLTLLLGLM LAGLLLAYGL LWILAVVLLP TWAIVLFGVL SVPAFLAFAC WFWIRVYYLP
VPALMIERTR VLAAIGRGFR LTRRQFWRTF GIALLTVIVT GIAGSMLSAP FTIAAQLLPL
AMAESRYAVL VLVVLSAIAT VVQTAFVTPF TSAVTSVQYL DQRIRKEAYD VELMTQAGIT
AS