Gene Noca_4970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4970 
Symbol 
ID4595343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp303064 
End bp305532 
Gene Length2469 bp 
Protein Length822 aa 
Translation table11 
GC content69% 
IMG OID639772752 
Producthypothetical protein 
Protein accessionYP_919412 
Protein GI119714270 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00792904 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTACAAGA TTGAGCAGCG CGGGCGACGC GTCGTGCTGC CTGCGACCGA TGACCCGGGC 
GTGGTGCTGG AGTTCTCGAT GTGGGATGCG GAGAAGACGC TCAAGTCGTT CATCCGTGTC
TCCGGCGGAG AGCGCCGCAG GCACCTGGCC AGAGTCAAGG TCGGCTTAGT CCACCGCTCC
GACAACACCT ACAACCCGGG CGCGGTCGCA GTCGTCATCC CTGCCGACCA GGGCGGCAGC
GTCGAGGAGC GGCACCTGGG CTTCCTCTAC GACGCCGACC TGCGCGAGAT CGGGTCGAGC
AAGCTCCCGG CCCTGATCCG GTACGCCGGC GGGGAGGTCG AGTGCGACGC CGTCATCGCT
GACCGGGCGA CTCTCGGCCT GGACCTGCCC GACCTCGCCG TACTCGGGAA CGCCATCGGC
AAGTTCCTCC GCAGCAGTGG CGCCGACCCT GGGGCATCGG TGATGCCCGA ACCGGTTCCT
GAACGTATCG GGGACAGGTT CCGTACCCGA ACCATTCTCG GTACCGACGA AACCCTCGAC
CTGCTGCGCG CCTCCCCCGA ACCGGGAGGC GCCATCGCTG GGGTGGACAT CTCGATGGCC
TATCCGTTCG AGAGCGACTC GTGGACGCTG AACCTGCACG AGTCGGCCAC CGGCCGGCAC
CTGGGCACAG TCCAGGACAC CACGTTGTTC CTCGGCAACG AGCGCGACCG TGAGGCCGTA
CTTCCCCACC TGGCTGCCCA GGAGGTCCCG GCCCGGCAAC CGAAGCCGGC GCCGCGGGCG
ACCGTTCCCG GGTCGTGGCC CACCGGTCAG GTACCGAACG TTCGACCGAA GTGGGACCGT
GGAGCGATCG CGCTGCACGC AGCAGGCGCT CCCGAGAGTC ACTCCATGGA ATCGTTCGCG
ATCTTCAACC CCGAGACCGG AGTCCTGTGG GTCGAGGATC AGCGGCTGGT CGACGTCGCG
TCCCTGTGCA CCAGCCGTCT CGGCCTCGAC GTCAAGCGCG TACGCACTCC GCGGGAGCCG
TGGGAGCTGG ACGAGGAGGT TTCCCACGAG GACCTGTGGG GCACGCGGCG CCGAGGAACG
TCGATCGACC CATGGGCCAA GCCGGCGCTG CTGGCGTACC AGCAGCGCGC CGTCCCGCGC
GGGATCCTGA CCAGGCTGGT CACCTTCAAC GGCGTCGCAG ACCCGGCCGC GTCGCAGGCG
GCAGCAGATG CACGGTTCGC GGTCCACCAC CGGTTCGTCC GGGGCAGGCG ACGCATCTTC
CCCGAACACA CGCTGCTGGG CACCTCGGCG CCGTGCCGGT TGTGCGGTGC CGCTGCGCTT
GAGTTCACGA CCCCGATCAG CACCGGGGAA CTTGCGTACT GCCACACCTG CCTGCGGCAC
GCGACCTCCG GGCTTGGGGA TGACCTGGAC CGGGCGGTGA AGGCGGTGAA GGTCATCATC
GAGACCGAGT TCGACGGCGA CCTGTTCCTC GAGCAGCAGC TGGCGCCGCT CCACATCAGC
CCTGCCGCCC CGGTCGCGGC CGAGTTGATC GACCGGGCGC TACTGCTGAG GTTCGCGATC
GCCCGCGGCG TGTTCCCGTG GACCCACGTC CTGGAGGCGA CCGGCCTGGC GAAGGATGGG
CTGCGCACCA GCCGGGGAAC CCTGATCCGG GCACGCGACG GGCACCTGTG CCATTCGCTG
CGAGAGAAGG CCGTGTGCGA CTTCCTGCAC ATCCGCGGCA TCTCCCACAC CCGGGAGCCG
ATGTACCCCA GCGACGTCGA CTACAACCCA AACGGGCTCC GTCGAGCCGA CTGGGCGCTC
GCAGACGGGA CCCTGGTGGA GCTGTGGGGC ATGCCGGACG ACCCCGCCTA TGCCGCGAAG
ATGGTGGAGA AGCGCGAACT GACGGCGCGC CACGGGCTGC GGCTGGTCGA GCTCCTGGAC
CGCGACCTGC CCAACCTGCC GGACATCTTC GCCGAATGGG CACCCGACGG TGTCGACTCC
GGGTGGGAAT GGTCCCCGCT CCTGCTCGTA TCCCAGCAGG CGCAGGATGC TGCGGCGGAG
AAGGCCGTCC GGGCGAGGGC CGCCAAGAAG GCGGCGCCCG ACGGCACGGT TCGCGGCGTG
AACGCGGCGA GCGTCGCTGC TCGTGACGAG CGGCTCGAAC GGTGCCGCCA CGCGCTCCAA
CTGCAGGCCA AGGGACGGAC CCGGGCCACC ATCGCCGCGG AACTCGGTGT GAGCAGTGAT
GCAGTGAAGG CACTGCTGCG CGACGGCAGG TTCTACGCAG ACCCCTCGAC CGCTCCGGCG
CGCCGCGGAC TGGCGGAGCT GGCAGCGAAA GCCCGGAACT CCGGCACGAC GAGGACGGCG
TTCCAGGAGG CACATGGTCT CTCGCCCGAC AGGGCGAAGG AGTGCTGGCG GGACGCCGGG
GTGCTGGTCC CCACCGACAT CGACCGGCAG TCCTCAGCAG GGTCAAAGGG GGACGCTGGG
TCAGGGTGA
 
Protein sequence
MYKIEQRGRR VVLPATDDPG VVLEFSMWDA EKTLKSFIRV SGGERRRHLA RVKVGLVHRS 
DNTYNPGAVA VVIPADQGGS VEERHLGFLY DADLREIGSS KLPALIRYAG GEVECDAVIA
DRATLGLDLP DLAVLGNAIG KFLRSSGADP GASVMPEPVP ERIGDRFRTR TILGTDETLD
LLRASPEPGG AIAGVDISMA YPFESDSWTL NLHESATGRH LGTVQDTTLF LGNERDREAV
LPHLAAQEVP ARQPKPAPRA TVPGSWPTGQ VPNVRPKWDR GAIALHAAGA PESHSMESFA
IFNPETGVLW VEDQRLVDVA SLCTSRLGLD VKRVRTPREP WELDEEVSHE DLWGTRRRGT
SIDPWAKPAL LAYQQRAVPR GILTRLVTFN GVADPAASQA AADARFAVHH RFVRGRRRIF
PEHTLLGTSA PCRLCGAAAL EFTTPISTGE LAYCHTCLRH ATSGLGDDLD RAVKAVKVII
ETEFDGDLFL EQQLAPLHIS PAAPVAAELI DRALLLRFAI ARGVFPWTHV LEATGLAKDG
LRTSRGTLIR ARDGHLCHSL REKAVCDFLH IRGISHTREP MYPSDVDYNP NGLRRADWAL
ADGTLVELWG MPDDPAYAAK MVEKRELTAR HGLRLVELLD RDLPNLPDIF AEWAPDGVDS
GWEWSPLLLV SQQAQDAAAE KAVRARAAKK AAPDGTVRGV NAASVAARDE RLERCRHALQ
LQAKGRTRAT IAAELGVSSD AVKALLRDGR FYADPSTAPA RRGLAELAAK ARNSGTTRTA
FQEAHGLSPD RAKECWRDAG VLVPTDIDRQ SSAGSKGDAG SG