Gene Noca_4622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4622 
Symbol 
ID4596078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4899480 
End bp4900589 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content75% 
IMG OID639779231 
Productribokinase-like domain-containing protein 
Protein accessionYP_925804 
Protein GI119718839 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0428968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTCA CCGAGCGCGA GCAGGAGATC GTGACCCTGC TGCGGCGCGA TCCGCTCGTC 
AGCTCGGCGG CGATCGCCGA AGCGCTCGGG ACCACCCGCG CCGCCGTCAA CGTGCATGTC
TCGAACCTCA CCCGCAAGGG CATCGTCCTC GGCCGGGGCT ACGTGCTGAA CGAGGGCCCC
TCGGTGGTGG TCGTCGGCGG CGCCAACATG GACGTCAAGG CCCGCAGCAC CCGCGCCGCC
GTGGTCGCCA CCAGCAACCC CGGCACGGCC GCGATGGCCG CCGGCGGGGT CGGTCGCAAC
ATCGCCGAGA ACCTGGCCCG ACTCGGCACC CGGACCCACC TGGTCGCCGC GATCGGCAGC
GACGCGCTCG GGGACCAGGT GCTCGCCGCG ACCTCGAACG CAGGGGTGGT GGTGGAGCAC
GTACGCCGCA GCGCCCGGTC GACCGGCACC TACACCGCGG TCCTCGACGC CGACGGCGAG
CTGGTCGTCG CGGTCGCCGA CATGGCCGCC ACCGACGAGC TCCTGCCTGA CCAGGTCGCG
GCGGCGCGCG ACCTGGTGTC CGCCGCGTCG CTGGTCGTCC TCGACGGGAA CCTCTCGACC
GGCACGCTGC GCTACGCCCT CGACCTGGCC GCGGAGGTCG GCACCCGGGT GCTGCTGGAC
CCGGTCAGCG TCCCGAAGGC TGCCGCGCTC GCGCCGCTCG TCACCGTCGA CCGGCCGGTG
TTCACGGTGA CCCCCAACCG CGACGAGCTC GCGGCCCTGA CCGATCTCCC GACCCGGACC
CGGCGCCAGC AGGAGGCGGC GGCGCGGGCC CTGCACGACC GCGGCGTCCA GCTGGTCTGG
GTGCGGCTCG GCCCGGCCGG CTCGCTGCTC AGCTCACCGA CCGGCGTCGT CGCCCTGGAG
GCCGTCCCGG CGGGGGTGGC CGGGGAGGTC ACCGACGTGA CCGGCGCGGG CGATGCGATG
ACAGCGGCCT TCTGCCACGC CCTGCTGACC GGCTCCGACC CGGCCGAGGC CGCGGCGTAC
GGCCACGCCG CCGCCGCCCT CACCGTCGCC AGCACCGACA CCGTCCGAAC CGACCTCACC
GACCGACTCG TCAGGAGCCT GCTGTCATGA
 
Protein sequence
MNLTEREQEI VTLLRRDPLV SSAAIAEALG TTRAAVNVHV SNLTRKGIVL GRGYVLNEGP 
SVVVVGGANM DVKARSTRAA VVATSNPGTA AMAAGGVGRN IAENLARLGT RTHLVAAIGS
DALGDQVLAA TSNAGVVVEH VRRSARSTGT YTAVLDADGE LVVAVADMAA TDELLPDQVA
AARDLVSAAS LVVLDGNLST GTLRYALDLA AEVGTRVLLD PVSVPKAAAL APLVTVDRPV
FTVTPNRDEL AALTDLPTRT RRQQEAAARA LHDRGVQLVW VRLGPAGSLL SSPTGVVALE
AVPAGVAGEV TDVTGAGDAM TAAFCHALLT GSDPAEAAAY GHAAAALTVA STDTVRTDLT
DRLVRSLLS