Gene Noca_4184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4184 
Symbol 
ID4596698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4422049 
End bp4423881 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content71% 
IMG OID639778790 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_925368 
Protein GI119718403 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGACT TGGTCGCGCG TATGAGGCGG CGCCGCGCGC CGCTCGCCAT CGCGTTCGAC 
GTGTGTGCCT GGTTGCTCGG CTACGCGGCG TTCGCCTGGC TTCGGCTCGA CAGTGACGCC
TCGAGCGTGC CGTGGCCGGA GGTGCTCGCC TTCGCGACCG TGACTGCGAC GCTGTACGTG
GCACTGGCGG CGCCCCTCCG GCTCCACCAG GGCCGCGCCC GCACCGCGAG CCTGGAGGAG
ATGGTGCTCC TCGGGATGCT GATGGGCGGC GTCGGTGGCG GCGTCTTCCT GGTGAACCTG
TTCGGCCAGT GGATCCCCCG CAGCATCCCG GCCGGGGCGA CCTGCGGTGC GCTCGTGCTC
GCCGCGTGGG CGCGGGCGAG CTGGCGGACC CTGCAGGAGC GCGACGAGCG CACCGGCGCC
GGTGAGGACA CCGAGCGGAC GCTGGTGATG GGCGCGGGTG AGGCCGGCCG CGAGCTGATC
ACCTCGATGC AGCGCGACCC GCTGCGGCGC TACCTGCCGG TCGGACTGCT CGACGACGAT
CCCTACAAGC GGCACCGCCG GCTGCGCGGC GTACCCGTCC TCGGGACCAG CTGCGACCTG
GAGAAGGAGG TCGCGCGCAC CGGGGCGACC ATGGTCGTGA TCGCGATCCC CAGCGCCAGT
GCCGAGACGG TCAACCGGCT CCGGCTCACG GCTCTGGACG CCCATGTGTC GGTCAAGGTG
TTGCCGTCCA CCACGCAGCT CCTCACCGAC AGCGTCGGCA TCCGCGACCT GCGGGACATC
AACATCACCG ACGTACTCGG CCGCAACCAG CTGGACACGG ACGTCGCCTC GATCGCCGGC
TACCTGGCCG GGCGGAAGGT GCTGGTCACC GGTGCCGGCG GCTCCATCGG CTCCGAGCTG
TGCCGCCAGA TCTACCGCTA CCAGCCGGCC GAGCTGATGA TGCTCGACCG TGACGAGTCC
GCGCTCCACT CGGTCCAGCT CTCGATCCAC GGCCGTGCCC TGCTGGACTC GGACGACGTC
ATCCTGTGCG ACATCCGGGA CGAGAAGGCG GTGCGAACCA TCTTCGCGAA CCGTCGCCCC
GACGTCGTCT TCCACGCCGC CGCGCTCAAG CACCTGCCGA TGCTCGAGCA GTACCCGGCC
GAGGCCGTGA AGACCAACGT GATCGGCACC CGCACGGTTC TCGATGCCGC AGACCTCGTC
GGTGTCGACA GGTTCGTGAA CATCTCCACG GACAAGGCGG CGAACCCGTC CAGCGTCCTG
GGCTACTCCA AGCGGGTCGC CGAGCGGATC ACAGCCGCCC AGGCGCGCGA GGCTTCCGGG
ACGTATCTCT CGGTGCGCTT CGGGAACGTG CTCGGCAGCC GTGGCTCGGT GCTGGCGGCG
TTCGCCCGGC AGATCGCCGC CGGTGGGCCG ATCACGGTCA CCCACCCGGA CGTCAGCCGC
TTCTTCATGA CGATCGAGGA GGCCTGCCAG CTGGTCATCC AGGCGGCTGC GATCGGCGGG
CCGGGGGAGG CGCTCGTCCT CGACATGGGC GAGCCGGTGA AGATCGTGGA CGTCGCCGAG
CAGCTGATCG AGCAGGCCGG CACGCCGGTG CCGATCGAAT ACACCGGGCT GCGCGAGGGC
GAGAAGTTGC ACGAGGAGCT CTTCGGCGAA GGCGAGCCGT GCGACGTCCG GCCGCGGCAC
CCGCTGGTCT CGCACGTGCC GGTGCCACCG ATCACCGACG GCGAGGTCCT GGGCCTCACC
CTCGTCGGCG AGCCTGACGA CGTGCGGCAG GCGCTGCACG ACGCGTGCCT GGTGTCGATC
GAGGCCGACG ACCCGTCCTC GCTGCGGAAC TGA
 
Protein sequence
MSDLVARMRR RRAPLAIAFD VCAWLLGYAA FAWLRLDSDA SSVPWPEVLA FATVTATLYV 
ALAAPLRLHQ GRARTASLEE MVLLGMLMGG VGGGVFLVNL FGQWIPRSIP AGATCGALVL
AAWARASWRT LQERDERTGA GEDTERTLVM GAGEAGRELI TSMQRDPLRR YLPVGLLDDD
PYKRHRRLRG VPVLGTSCDL EKEVARTGAT MVVIAIPSAS AETVNRLRLT ALDAHVSVKV
LPSTTQLLTD SVGIRDLRDI NITDVLGRNQ LDTDVASIAG YLAGRKVLVT GAGGSIGSEL
CRQIYRYQPA ELMMLDRDES ALHSVQLSIH GRALLDSDDV ILCDIRDEKA VRTIFANRRP
DVVFHAAALK HLPMLEQYPA EAVKTNVIGT RTVLDAADLV GVDRFVNIST DKAANPSSVL
GYSKRVAERI TAAQAREASG TYLSVRFGNV LGSRGSVLAA FARQIAAGGP ITVTHPDVSR
FFMTIEEACQ LVIQAAAIGG PGEALVLDMG EPVKIVDVAE QLIEQAGTPV PIEYTGLREG
EKLHEELFGE GEPCDVRPRH PLVSHVPVPP ITDGEVLGLT LVGEPDDVRQ ALHDACLVSI
EADDPSSLRN