Gene Noca_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2040 
Symbol 
ID4598663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2183254 
End bp2184510 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content73% 
IMG OID639776643 
Producthypothetical protein 
Protein accessionYP_923236 
Protein GI119716271 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.923068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACACCC AGACCCTGAT CAACCTCGCC CTGGTCCTGT TGTTCATCCT GGTCGGCGGG 
GTGTTCGCCG GCACCGAGAT CGCGCTGGTC TCGCTGCGCG AGGGCCAGAT CAACACGCTC
GCCTCGCGCG GCGCGCGCGG CGCCAGGGTG GCCTCCGTGG CCCGCGACCC GAACCGGTTC
CTCGCCGCGG TCCAGATCGG CGTCACGCTG GCCGGCTTCT TCTCCGCGGC GTACGGCGCC
TCCACCCTCG CGCCCGACTT CGCGCCGGTG CTCGAGCACG CCGGGCTCGG CGCGGACGCG
GCCGACACCG CCGCGCTGGT GCTGCTGACA CTGTTCATCG CCTACCTCTC CCTGGTGTTC
AGCGAGCTGG TCCCCAAGCG GCTCGCCCTC CAGCGCGCGG CCGGCGTGTC CTACCTCGTC
GGCGCGCCGC TGGACCGGTT CGCGACCCTC ATGCGCCCGG TGGTGTGGCT GCTCTCGGTC
TCCACCAACG CGGTGGTCCG CCTCTTCGGC GGCGACCCCG GCGCGGCCGC CGAGGATCTC
AGCGACGAGG AGCTGCGCTA CCTGGTCGAC CAGCACGAGG GCCTCGCCGA GGACGAGCGG
CGGATCCTCG CCGACGTCTT CGACGCCGGC GACCGGTCCC TGAGCGAGGT GATGCGGCCC
CGCGGCGACG TGACGTTCCT GGCCGGCGAC GCCACGGTCG CCGACGCGAT CGCCATCGCG
CTGACCAGCC CGTACTCCCG CTACCCCGTC ACCGGCACCG GCCACGACGA CATCCGCGGC
TTCCTGCACG TGCGCGACCT GCTGGGCGCC GACCCCCGCA AGCGGGTGCG CTCGATCACC
CGCAAGATCC TGCACCTGCC CGCCACCAAC CGGGTGCTCC CCTCGCTCTC CCGGATGCGG
GCCGAGGGCA GCCACATCGC CGTCGTCGTC GACGAGTACG GCGGCACCGA CGGCATCGTC
ACCCTCGAGG ACCTGGTCGA GGAGCTGGTC GGCGACATCC ACGACGAGTA CGACGAGCGG
GCGAGCGTGG CGGCCGGCGA GGTGGACGCG GGGCTGACCA TCGAGGAGTT CGGCGAGCGC
ACCGGCGTCG AGCTCGAGGA CGGCCCCTAC GAGACCGCCG CCGGGTACGT CGTGCACCGG
CTCGGCCGGC TCGCCGTGGC CGGGGACGTG GTGGCCGTGG GCGAGCACGA GATCGAGGTC
GCGACCGTCG ACAAGCACCG GATCACCCGG CTGCGGGTGC GCCCGCGCGA GTCCTGA
 
Protein sequence
MDTQTLINLA LVLLFILVGG VFAGTEIALV SLREGQINTL ASRGARGARV ASVARDPNRF 
LAAVQIGVTL AGFFSAAYGA STLAPDFAPV LEHAGLGADA ADTAALVLLT LFIAYLSLVF
SELVPKRLAL QRAAGVSYLV GAPLDRFATL MRPVVWLLSV STNAVVRLFG GDPGAAAEDL
SDEELRYLVD QHEGLAEDER RILADVFDAG DRSLSEVMRP RGDVTFLAGD ATVADAIAIA
LTSPYSRYPV TGTGHDDIRG FLHVRDLLGA DPRKRVRSIT RKILHLPATN RVLPSLSRMR
AEGSHIAVVV DEYGGTDGIV TLEDLVEELV GDIHDEYDER ASVAAGEVDA GLTIEEFGER
TGVELEDGPY ETAAGYVVHR LGRLAVAGDV VAVGEHEIEV ATVDKHRITR LRVRPRES