Gene Noca_4630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4630 
Symbol 
ID4596086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4909269 
End bp4911503 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content73% 
IMG OID639779239 
Productglycoside hydrolase family protein 
Protein accessionYP_925812 
Protein GI119718847 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCCGG CGGACCGCCC TCCGGTTACC CACCGTCCAC ACGGCATCGA GCACCCCTAT 
GTGCAGTCCG CCGACCAGCG GTGCCCGGTG CTGCCCCTGG CCGGCAGCAC CTGCCGGATC
GGAGTCACCG CCGACCCGGC CGTGGTCGCG GTCCGCTGCG AGTGGGGGGA CCAGGTGCTG
GCGATGACCC CACGCACCGT CGATGCCGCC GACGCGGCCG CCCTCGCCGG CGGCGAGGGC
CACCTGGCCG AGGCCCAAGC GAACATGCTC GAGGCCGACG GGGCGTGGAC CCTGGAGACC
CCCGTGCTGG TGGAGGGCTG TCGCCTCACC TATCGGTTCG TGGCCGTGAC CGCCGACGGC
GTGGAGGTCG CGACGCCGGC GTACCACGTC GCACCGGCGG TCTGGTCGGC GAGCCACCCG
GGTCGTCTCC AGGGGGCGGC CGACCGGCTC GTGCCCGGGA GCACGTCGTG GCTCGTCAGC
CCCGACGGCG TCCACCGGGT GCGGTTCGCG CTGGCCCTGC GGCCCGGCGA CCACGTCGTC
GGGTTCGGGG AGCGCTTCGA CCGGCTGGAC CAGGCCGGCC AGCGCCTCGA CGCCGTGGTG
TTCGAGCAGT ACAAGTCGCA GGGTCGGCAC GGGCGGACGT ACCTCCCGAT GCCGTTCGCC
CACGTGGTCG GTGCCGACGG CGAATCGTGG GGCTTCCATG TCCGCACCTC GCGGCGCACC
TGGTACGACG TCGGCGCCAG CACCCCGGAA GCGCTGGTGG TGGAGGTCGA TCTGGGTACG
GGCGGGGACA TCGTGGACGT CGGCATCTAC GACGGCACGC CCACCGAGGT GCTGACAGAG
TTCCTCGACG AGGTCGGGCG CGCCGAGGAG CTCCCGGAGT GGGTGCTGCG GCTGTGGGCG
TCCGGCAACG AGTGGAACAC CCAGCGCCTG GTGATGGATC GGATGGACCG CCACCGCGAG
CTCGACATCC CCGTCGGGGT CGTCGTGATC GAGGCGTGGA GCGACGAGGA GGGCATCACG
ATCTTCCGGG ACGCCCGCTA CGAGCCGAAC CCCGACGGCC GGGCCCATGA CGCAGCCGAC
TTCGCCTACC CGCCGGACGG CGCCTGGCCG GACCCGCACG GCATGGTCGA CGACCTGCAC
TCCCGGGGCA TCAAGGTGGT GCTCTGGCAG ATCCCACTCC AGAAGACCGA CCCCGAGCTG
ACCGGCCAGG TGCGGATGGA CGCCGAGGCG ATGGTGCGCG AGGGGCACGC AGTGCTCGAG
GCCGACGGAT CGGCGTACCA CAACCGCGGA TGGTGGTTCC CGCGGGCGCT GATGCCTGAC
CTCTCGGTGC AACGCACCCG CGACTGGTGG ACGGCCAAGC GCCGCTACCT GCTCTCCGAC
CTCGACGTCG ACGGCTTCAA GACCGACGGC GGCGAGCACG CCTGGGGGCA CGACCTGCGC
TACGCCGACG GACGACGTGG CGACGAGGGC AACAACCTCT TCCCCGTCCA CTACGCGCGC
GCCTTCGGTG ACCTGCTGCG GTCCGAGGGC AAGGCGCCGG TGACGTTCTC GCGCGCCGGC
TTCACGGGGT CGCAGGCGCA CGGGGTCTTC TGGGCGGGTG ACGAGGACTC GACCTGGGAG
GCCTTCCGGA GCTCGGTGAC CGCCGGTCTG ACGGCCGCAG CCTGCGGCAT CATCTACTGG
GGCTGGGACC TGGCGGGCTT CTCGGGCCCG GTGCCGGACG CCGAGCTCTA CCTGCGGGCC
GCCGGCGCCT CGGTGTTCAT GCCGGTCATG CAGTACCACT CCGAGTTCAA CCACCACCGC
CCGCCGTTGC GCGACCGGAC GCCGTGGAAC GTGCAAGAGG CGAGCGGCGA CGAGCGGGTG
GTGCCGGTGT TCCGGCACTT CGCGCGGATG CGTGAGCGGC TGGTCCCCTA CCTCGCCGAG
CAGGCACGAG CGACGGTCGC CACGGACCGG CCGCTGATGC GGCCGCTGTT CTTCGACCAC
CCTGCCGACC CGGCGCTCTG GGCGCACCCC CTCCAGTGGA AGCTGGGCGA CGGCATGCTG
GTCAACCCGG TCACCGAGCC CGGCGCGACC GCGTGGTCGA CGTACCTGCC CGCAGGCCAG
TGGGTCGACG CGTGGACCGG GACGGCGTAC GCCGGCGAGC AGGTCGTCAC CGCCGAGGAG
CTGCCCCTCG ACCGGGTCCC CGTCTACCTC GCCGCCGACG CCGCGCCCGG CCTCGCCGCG
GTGTTCTCCG AATGA
 
Protein sequence
MNPADRPPVT HRPHGIEHPY VQSADQRCPV LPLAGSTCRI GVTADPAVVA VRCEWGDQVL 
AMTPRTVDAA DAAALAGGEG HLAEAQANML EADGAWTLET PVLVEGCRLT YRFVAVTADG
VEVATPAYHV APAVWSASHP GRLQGAADRL VPGSTSWLVS PDGVHRVRFA LALRPGDHVV
GFGERFDRLD QAGQRLDAVV FEQYKSQGRH GRTYLPMPFA HVVGADGESW GFHVRTSRRT
WYDVGASTPE ALVVEVDLGT GGDIVDVGIY DGTPTEVLTE FLDEVGRAEE LPEWVLRLWA
SGNEWNTQRL VMDRMDRHRE LDIPVGVVVI EAWSDEEGIT IFRDARYEPN PDGRAHDAAD
FAYPPDGAWP DPHGMVDDLH SRGIKVVLWQ IPLQKTDPEL TGQVRMDAEA MVREGHAVLE
ADGSAYHNRG WWFPRALMPD LSVQRTRDWW TAKRRYLLSD LDVDGFKTDG GEHAWGHDLR
YADGRRGDEG NNLFPVHYAR AFGDLLRSEG KAPVTFSRAG FTGSQAHGVF WAGDEDSTWE
AFRSSVTAGL TAAACGIIYW GWDLAGFSGP VPDAELYLRA AGASVFMPVM QYHSEFNHHR
PPLRDRTPWN VQEASGDERV VPVFRHFARM RERLVPYLAE QARATVATDR PLMRPLFFDH
PADPALWAHP LQWKLGDGML VNPVTEPGAT AWSTYLPAGQ WVDAWTGTAY AGEQVVTAEE
LPLDRVPVYL AADAAPGLAA VFSE