Gene Noca_1563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1563 
Symbol 
ID4595503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1657333 
End bp1659564 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content71% 
IMG OID639776162 
Productglycosyl transferase, group 1 
Protein accessionYP_922764 
Protein GI119715799 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0392] Predicted integral membrane protein
[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR00374] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGGA CCTACGCGGC CGAACACTCG CCCGACGCGC TCGCCGGCGC CGCGCTTTCC 
CCCGCGCGCC GTTCACAGAT GTGGGGGCGC CTGCGGTTCC TCGGCTCCTG GATTCTCGCG
CTCATCCTGG TTGCGGTTGC CCTGCCTCGG ACCGTCGACG TCTCCTGGCA CGGCCTGCTT
CCGGCGTTGC GGGCCGTGCA CTGGCCGGCC CTGCTGGCCC TTGTGGCCTT GTGGCTCCTT
GGGCTCTTCG TCCACTCCTT CGTCCTGACT GCGGCAGCAC CCGCCCTGAC CCACCGCCGT
GCCCTGACCC TCAACGTGAC CGGCAGCGCC GTCGCCAACG TGGTCCCCCT GGGCGGGGCC
GCAGGTGTCG AGCTCAACCG CCGGATGATG AAGGCCTGGG GCATCGACAC CCGTGCCTTC
GCCGGCTACA CCTTTCTCAC CAACCTGTGG GACGTCGCGT CCAAGCTGCT CCTGCCCATG
ATTGGCGTCA TCGCACTCGT CCACGCAGGC GAGACGATCA CGCCCCAACT CAAGACCGCC
TCAGTGCTCG CCGGCATCGC CTTCCTCGGG CTTGCCGCTT TCGCAACGGT GCTCCTTCTC
AGCCCGAGAG GAACCGCTCA ACTCGGACAC GGGATCGAGC AGACGCTGCG GTTCGGGCTG
CGGCTGATCG GCCGCGACCG TGCACTGGAT CTGGCCGGCC GGCTCAACGA CGTCCGCTCC
GAATGCGCCG GCCTGGTCGC CTCCGGATGG GTTCGAATGA CGGCCGGCAT CACCGGGTAC
GTCGCCCTGC AATGGCTCCT CCTCGGGTTC TGCCTGCAGC TGACCGGCGC CGGAACCACC
TGGCCCGAAG TGCTCGCCGG CTTCGCCGTC GAACGGCTCT TCACGATCGT CCCCCTCACC
CCAGGCGGAG TCGGCGTCGC GGACCTCGGC CTCGTCGGCG TCCTGCTGAC CCTCGGCGGT
GACCCGGCCG GCGTCACCGC CGCGGCCGTG CTCTATCGCC TGTTCGTCTT CGCCGTCGAG
ATCCCCGTCG GCGGCGGCGT CCTCGGAATC TGGTTCCTCG CCCAGCGCCG AACTCCCGCG
CGTCCGCAGG ACGCGGTGCG GTCGCTCGGC CCGACCCGCC GGATCGCGCA CGTCACCGAC
GTCTTCCTGC CGCGGCTCGG CGGGATCGAG ACCCACGTCG ACGACCTCGT CCGACACCAG
CGGGCAACCG GGCTCGACGC CCACGTCCTA ACCCCCACCC CTGGCAACGG CCCCGACCCG
GCATGGGTGC GCCGGATGCC CGCCGTCGTC GCGCGCGGCA CCGTCACCGA GTACGACACC
ATCCACGTCC ACATCTCCAT GTGGTCGCCC TACGGGATCG CGGTCGCTCG CGCCGCGATG
GCCGCCGGGA TGCCCACCCT CATCACCGTG CACTCGATGT GGGCCGGCGC CGGCGGACTC
CTCCGGCTCA TCGCGCTCGC CGGCCTCCGA CGCTGGCCCG TCGCCTGGTC CGCGGTCAGC
GGAGCCGCCG CCGAAGCATT CCGACGCTCG CTGAACGGAG CCGAGGTAGC GGTGCTACCC
AACGCGATCG ATATCGACTC CTGGCGACCG CGTCCGGCCC CCATCGCTCG GCGCGAGCCG
CAGGCCGCTG AGGGGCCGCT CACCGTGGTG AGCGTGATGC GCCTGATGCC GCGCAAACGA
CCCCTCCAAC TGCTCGACAT GTTCGAGCGG ATCCGCGCAC TCACTCCGAA CGACGATGTT
CGGCTGGTCA TCGTCGGTGA CGGCCCGCTC CACAGACGGC TGCAACGCGC GGTGCGCCGA
CGCGGCCTCG ACGAGCGCGT GCGGATCACC GGGCGCATCC CTCGACACCA GGTGCTCGAG
GAGCTACAGG CCGCCTCGCT GTACGTCGCA CCCGCCCCCA AGGAATCCTT CGGAATCGCC
GCACTCGAAG CACGCTGCGC CGGCCTGCCC GTCGTCGCAC ACCGCCGCAG CGGCGTCGGC
GAGTTCGTCC GCGACCGCGT CGACGGCATC CTGGTCGCCA ACGACACCGA GATGGTCGTC
GCGATCGCCG ACCTGGTGCG CGACCCGAGC CTCCGCAACC GGTTCACCGC GCACAACCGA
CGCGTCGCCC CGAAGCTCGA CTGGAGCGAC ATTCTCCGGC AAACCGATGC GCTCTACCTG
AAGGCCGCCA GCCGCGCCGT TCTCTTGACC CAAGAAATTG CCGCGCCGGT GCCAGTGATG
GCGGAAGCCT GA
 
Protein sequence
MTGTYAAEHS PDALAGAALS PARRSQMWGR LRFLGSWILA LILVAVALPR TVDVSWHGLL 
PALRAVHWPA LLALVALWLL GLFVHSFVLT AAAPALTHRR ALTLNVTGSA VANVVPLGGA
AGVELNRRMM KAWGIDTRAF AGYTFLTNLW DVASKLLLPM IGVIALVHAG ETITPQLKTA
SVLAGIAFLG LAAFATVLLL SPRGTAQLGH GIEQTLRFGL RLIGRDRALD LAGRLNDVRS
ECAGLVASGW VRMTAGITGY VALQWLLLGF CLQLTGAGTT WPEVLAGFAV ERLFTIVPLT
PGGVGVADLG LVGVLLTLGG DPAGVTAAAV LYRLFVFAVE IPVGGGVLGI WFLAQRRTPA
RPQDAVRSLG PTRRIAHVTD VFLPRLGGIE THVDDLVRHQ RATGLDAHVL TPTPGNGPDP
AWVRRMPAVV ARGTVTEYDT IHVHISMWSP YGIAVARAAM AAGMPTLITV HSMWAGAGGL
LRLIALAGLR RWPVAWSAVS GAAAEAFRRS LNGAEVAVLP NAIDIDSWRP RPAPIARREP
QAAEGPLTVV SVMRLMPRKR PLQLLDMFER IRALTPNDDV RLVIVGDGPL HRRLQRAVRR
RGLDERVRIT GRIPRHQVLE ELQAASLYVA PAPKESFGIA ALEARCAGLP VVAHRRSGVG
EFVRDRVDGI LVANDTEMVV AIADLVRDPS LRNRFTAHNR RVAPKLDWSD ILRQTDALYL
KAASRAVLLT QEIAAPVPVM AEA