Gene Noca_4771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4771 
Symbol 
ID4595373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp77241 
End bp80102 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content70% 
IMG OID639772560 
Producthelicase domain-containing protein 
Protein accessionYP_919220 
Protein GI119714078 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.360251 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCCC CTGCCCCCCT GGACGTCGCC GTCGGCAGCC TGGTCCGCGC CCGAGGTCGC 
GAGTGGGTCG TCCTGCCCGG CACCACTCCC GACTTCGTAC TCCTTCAGCC CCTCGGCGGC
GGCAGCGACG ACGTCGTCGG AGTCTTTCCC GACGAGGGAG TCGAACACGC CACCTTTCCC
CCGCCCACCG CCGCCGACCT CGGTGACAGC GCTTCCACTG CCCTGCTGCG CACCGCCCTG
CGCGTCGGCT TCACCGCCGG AGCCGGCCCC TTCCGCTCCC TGGCGGGCAT CAGCGTCAGC
CCGCGCAGCT ACCAGTACGT CCCCCTGCTC ATGGCCCTGA GACAGGAGAC CGTCCGGCTC
CTGATCGCCG ACGACGTCGG CATCGGCAAA ACCATCGAGG CCGGCCTGAT CGCCGCCGAA
CTGCTCGCCC AAGGCACCGT CCGGCGCCTC GCCGTCCTGT GCTCACCGGC ACTCGCCGAG
CAGTGGCAGC GCGAGCTCGC CACCAAGTTC GGCATCGACG CCGCGCTCGT GCTCACCAGC
ACCGTAAAGC GCCTCGAGCG CGGCCTGATG CTCAACGAGT CGCTGTTCGA CCGCTACCCG
CACGTCATCG TCTCCACCGA CTTCATCAAA TCCGCCCGTC GCCGCCACGA CTTCCTTCTC
CGCTGCCCCG AGCTGGTCAT CGTCGACGAA GCCCACAACT GCGTCGCCGG CGCTGGCGTC
GGCCATCGGT CCCGACACCA GCGCTACGAG CTGCTGCGTG ACCTGGCCGC CGATGCGACC
CGCCACCTCG TGATGGCCAC CGCAACCCCG CACTCCGGCG ACGAGGCCGC GTTCACCAAC
CTGGTCGGCC TCTGCAACCC CGAGCTCGCC ACGGCCGACC TCAGCTCCGA GAAGGGCCGC
CGCCTTCTCG CCCAGCACTT CGTGCAACGC CGCCGCGCGG ACATCCGCCA CTACCTCGAC
GAGGACACCC CATTCCCCAA GGACCGCCTG ACCCTCGACG TTCCCTACAC GCTCTCGCCG
GCCTACCGCG ACCTGTTCGA CGATGTTCTC GCCTACGCCC GCGAGCAGGT CCGCGACGGC
GTCGACGGCA CGCGTGGACG CGTCCGATGG TGGTCTGTCC TCGCACTGCT CCGAGCGCTC
GCCTCCAGTC CCCAGGCGGC AGCCGCGACT CTGCGAACGC GCGCAGCCAA CGCCGAGGCC
ACCACCGCTC AGGAGGCCGA TGAACTTGGC CGGGCTGCCG TCCTCGACAG CGCTGACGAC
GAGTCACTCG AAGGGATCGA CACCACGCCC GGCGCACTCA CCGACGACGA GACCAGCGCC
GGTACCGACA CTGCCGAGCG GCGGCGCCTG CTCGGATTCG CTCGGCGGGC CGCCGACCTC
GCCGGGCCGG ACCACGACCG CAAGCTCGCC GCCGTCACCA CCCAGGTCAA GAAACTCCTA
GCGGACGGCT ACAGCCCGAT CGTGTTCTGC CGCTTCATCG ACACCGCCCA CTACGTGGCG
GCTCACCTCA ACACGGCACT CGGCACCAAG AGAAACCCCG TGCACGTCGT CAGCGTCACC
GGCGAGCTCC CACCCGCCGA GAGGGAACGC CGCGTGGGCG AGCTCACCGA ACTCGACGGC
GAACACGTCC TCGTCGCCAC CGACTGCCTC TCCGAAGGTG TCAACCTGCA GGAGCACTTC
TCCGCGGTCG TGCACTACGA CCTGTGCTGG AACCCCACCC GCCACGAGCA GCGCGAAGGA
CGGGTCGACC GCTACCTTCA GCGGAAGGAG GTCGTCCGCG CCGTCACCCT TTACGGCGAG
GACAACGGCA TCGACGGCAT CGTCCTCGAC GTCCTCATCC GCCGCCACCG TGCGATCGCC
AAGGCCACCG GCGTGGCGGT TCCCGTCCCC GGCGACGGCC AAGGCCTCAT CGACGCCCTC
GCCGAGGGCC TGCTGCTACG CCGCCAGGAC TCCCGCGATC AGCTCGTCCT GGACCTCGGA
CTGGACGAGC GCACCCAACA GCTCGAGGAC GCCTGGACCT CTGCGGCCGA GCAGGAGAAG
GTCTCCCGCA CCCGCTACGC CCAGCACGCG ATCAAGCCTG AGGAAGTCGC GGCCGAGGTC
GAGGAGCTCC ACGCCGCGCT GGGAACCCAT GGCGACCTCG AGACCTTCCT CACCGGCACC
CTGCGCGCTC TCGGCTCCAC CCTGACCACC GGGACCGACG CCTTCACCGC CGTCACCGCC
ACCCTGCCCC TCGGCCTGCG CACCGCCCTG CCGGTCGGGA TCCGCGACCC GCTGCCGTTC
CACTCAGAGC CGCCCGCAGG CCGCGGTGAG GCGGTCATCG CCCGCACCGA CCCCACCGTC
CAGGCAATCG CCCGCTACGT GCTCGAGAGC GCACTCGACC CCACCGTGCC AGCCCCGGAG
CGGCCAGCGC GGCGGGCCGG CGTCATGCGC ACCCGCGCCG TGCAGACGCG CACCACGCTC
CTGCTCGTCC GCTTCCGGTT CCAGCTGGAA CTTCCCGCAT CCGACGGGGT CCAGCAACGG
GTCGCGGAGG ACGCCCGAGT CCTTGCCTTC GAGGGCACGC CATCACAAGC CCTCTGGCTC
CCCGACGACC GGGCCGAGGA CCTGCTGTCA GCAACTCCGA CGGCCAACGT CATCGAGGGG
GCCGCCCACG ACGCGATCAG TAAGGTTCTC GACGGACTCT CGGCGCTCAC CCCGCACCTG
GAGTCAGTGG CCGACGAACA TGCGGACAGG CTGCTCGGCG CCCACCGCCG TGCCCGTACC
GGCGCCGGCG CTGCCCGCCG AGGCCTTGCT GTTACTGCAC AACGCCCCGT CGATGTGCTC
AGTGTCCAGC TCTTCTTGCC CGATCTCGGA GGTGCCTCGT GA
 
Protein sequence
MTSPAPLDVA VGSLVRARGR EWVVLPGTTP DFVLLQPLGG GSDDVVGVFP DEGVEHATFP 
PPTAADLGDS ASTALLRTAL RVGFTAGAGP FRSLAGISVS PRSYQYVPLL MALRQETVRL
LIADDVGIGK TIEAGLIAAE LLAQGTVRRL AVLCSPALAE QWQRELATKF GIDAALVLTS
TVKRLERGLM LNESLFDRYP HVIVSTDFIK SARRRHDFLL RCPELVIVDE AHNCVAGAGV
GHRSRHQRYE LLRDLAADAT RHLVMATATP HSGDEAAFTN LVGLCNPELA TADLSSEKGR
RLLAQHFVQR RRADIRHYLD EDTPFPKDRL TLDVPYTLSP AYRDLFDDVL AYAREQVRDG
VDGTRGRVRW WSVLALLRAL ASSPQAAAAT LRTRAANAEA TTAQEADELG RAAVLDSADD
ESLEGIDTTP GALTDDETSA GTDTAERRRL LGFARRAADL AGPDHDRKLA AVTTQVKKLL
ADGYSPIVFC RFIDTAHYVA AHLNTALGTK RNPVHVVSVT GELPPAERER RVGELTELDG
EHVLVATDCL SEGVNLQEHF SAVVHYDLCW NPTRHEQREG RVDRYLQRKE VVRAVTLYGE
DNGIDGIVLD VLIRRHRAIA KATGVAVPVP GDGQGLIDAL AEGLLLRRQD SRDQLVLDLG
LDERTQQLED AWTSAAEQEK VSRTRYAQHA IKPEEVAAEV EELHAALGTH GDLETFLTGT
LRALGSTLTT GTDAFTAVTA TLPLGLRTAL PVGIRDPLPF HSEPPAGRGE AVIARTDPTV
QAIARYVLES ALDPTVPAPE RPARRAGVMR TRAVQTRTTL LLVRFRFQLE LPASDGVQQR
VAEDARVLAF EGTPSQALWL PDDRAEDLLS ATPTANVIEG AAHDAISKVL DGLSALTPHL
ESVADEHADR LLGAHRRART GAGAARRGLA VTAQRPVDVL SVQLFLPDLG GAS