Gene Noca_2920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2920 
Symbol 
ID4597421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3101114 
End bp3102130 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content72% 
IMG OID639777525 
Producthypothetical protein 
Protein accessionYP_924109 
Protein GI119717144 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACC GGCAACAGCG GGCCACCTCG GCGAGCGGGG TGGAGCACCG GCCGCGGCCC 
GGTCGGATGG TGTGGATCAC CGCTTCTTGG GGTGCGTGCT TCGTGGCCAT CGAGTGGGGT
CTGCGGGACG CGCCCGTGCT GTGGTACGCC GCCCTGCGAG CCGTGCTCGC CGGTGCCGTC
CTGGTCGCCG TGGGAACCGC CCGGGGGCGT CCGACCCCGT CGTTGCCCCG GGACTGGGGC
TGGATCGTGG GGTTGGGGCT GATGAATGTC ACCGTCGCCT TCGCCGCCAT GTTCGCCGGG
GTGGCCGGGG GAACGACCGG CGCCGCCTCA GTGCTCGCCA ACGCCCAACC TCTGCTGATC
CTGCTGCCGG CATGGTGGCT CTATGGCGAG AGGCTGTCGG TCCTCACGAG CCTCGCGCTG
GTGGTCGGCT TCGCCGGCCT CGTCCTCGTT GCCGTACCCG GCGGAGGTGG CAGCGGCGCC
ATGCTCTCGC TGCTGTCCGC GGTGGCTGTC ACGGCCGGGA CCCTCATGTC GCGGCGCTTG
GCGAACGTCG ACGCGGTGCT TCTCACGGGT TGGCACCTTC TGATCGGCGG TGCTGCGCTG
GTGGGACTCG CCATGGCCGT GGAGGGAGCG CCGGCGATCG CGTGGACCCC CAGGTTCGTC
CTCTCGTTGC TCTTCCTAGC GTTGGTGGGC ACCGCAGGTA CGACGGTGGC GTGGTTCGTC
GAGGTCCGGC GCTCGCGGTT CGACCAGCTG ACCGCATGGA CGTTCCTGAC GCCTGTCGTC
GGGGTCGTGC TCGCGGTGGC GGTGCTCGGC GAGCGCCCCG CGGGGTGGAC GGGTGTCGGC
CTGGTCGTGG TCCTGATCGC CATGTGGGTC GTTCTGAGAC CAGCCGCCGC GCGGTTCGAC
GCGGGAGACG AGCCACCTGT CCGAGAGGGC GATCGGCGGC CTGGACCCCG GCAAGCGGTC
ACCCGAACGG CAGCGGCTCC GGTGCCAGCG AGACCGCCTT GGCGCGGGCC GCGGTGA
 
Protein sequence
MADRQQRATS ASGVEHRPRP GRMVWITASW GACFVAIEWG LRDAPVLWYA ALRAVLAGAV 
LVAVGTARGR PTPSLPRDWG WIVGLGLMNV TVAFAAMFAG VAGGTTGAAS VLANAQPLLI
LLPAWWLYGE RLSVLTSLAL VVGFAGLVLV AVPGGGGSGA MLSLLSAVAV TAGTLMSRRL
ANVDAVLLTG WHLLIGGAAL VGLAMAVEGA PAIAWTPRFV LSLLFLALVG TAGTTVAWFV
EVRRSRFDQL TAWTFLTPVV GVVLAVAVLG ERPAGWTGVG LVVVLIAMWV VLRPAAARFD
AGDEPPVREG DRRPGPRQAV TRTAAAPVPA RPPWRGPR