Gene Noca_4966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4966 
Symbol 
ID4595337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp297970 
End bp300246 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content68% 
IMG OID639772748 
ProductMMPL domain-containing protein 
Protein accessionYP_919408 
Protein GI119714266 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.586388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00568973 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCGCAA CACGTTCGAA GGACCTTGAA GGCCACCGCG TGCACGCGAG TGGTACGTCG 
ATCACCCAAC GGCTGGCCCG AGCAGCGGCG GTACACCCGT GGCGGGTTGT GGTCGCGTGG
GGGCTGATCC TGCTCGCTTC CATGGCGGCC ATCGCCACCC TGATCGGGTC GGCGTTCACC
TCCGACGGCA ACATCACCAG CAACCCCGAG TCGGCGAGGG CCGGACAGGT CATGGCGGAG
AACTTCTCCC AGGACGACCG CATCGACGAC GCCGTGATCA TCTACTCGGC CGACCTCACC
GCCGAGAATC CGGAGTTCCG GGCCTTCGTC GCCGATTTCC GGTCCTCAAT CGAAACCACG
GGTGCTGCAG AGACGGTCCG GGACCCTTAC GCCGCCGACG GGGCTGGCGT CTCCCAGGAC
GGACACGCCG CCGTCGTCAC GCTGGTCCTC GGCCACGATT CCGCGACCGG GATCGTGGAC
GTGATTGACG AGGTCGTCGC CGCGGACGCC GACGCGGCCT TCGAGGTCGA CATCACCGGC
ACCAACACCC TCGACCACGA CTTCACCGAG CTCTCGGAGT CCGACCTCAC CAACGGTGAG
GTGAAGTTCG GGCTGCCAGC AGCAATGATC GTGCTGGTCC TGGTCTTCGG CGCTCTGGTG
GCCGCCTTCC TACCGATGAG CATCGCCATC CTGTCGATCA TCGTGACCGT CGCCATCTCC
TCGATCCTCG GCCAGTTCAC CTCCCTGTCC TTCTTCATCG TCAACATGAT CACCGCGATG
GGGCTCGCCC TCGGCATCGA CTACAGCCTC TTCGTGCTCT CGCGCTACCG AGAGGAGCGT
CAGGCCGGCC GCGACAAGAT CGACGCGATC ATCGCCACCG GAGGCACCAG CAGCAAAGCC
GTCCTGTTCT CCGGCAGCTC GTTCGTCGTC GCCCTGCTCG GACTCCTCCT GGTCCCGGAC
ACCATCCTGC GCAGCCTGGC CCTCGGCGCG ATCATCGTCG GCCTGGTCAC CATGGCCGCC
GCCCTCACCC TGCTGCCCGC CCTGCTGTCC CTGCTCGGCG ACCGGGTCAA CGCCCTGCGG
CTGCCGATCG TCGGTCGCGA CCACCCCGCC GAGAGCCCCT TCTGGACACG CGCCGTCAGT
GCTGTAGTCC GCCGACCAGG CACCTTCCTG GCAGCCGGCC TGCTGGTCCT GCTCGCCGCA
GCCCTCCCCG TGTTGGGCCT CCACACCGGC ACCTCAGGCG TCAACTCGCT GCCCGACGAC
ACCTACGCCA AGGCCGGCGC CCTCGCCCTC GAACGGAGCT TCCCGGGCAC CAGCACGACC
GACCCCGCCC AGGTCGTCAT CTCGGGCGAC GTCACCTCCA GCGACGTCAC CGCAGCGATC
GCGGAGCTCG AGGCGTCCGT CGCCGACGAC CCGGACTTCG GCCCCTCGCA GCAGCGGCTC
GCCGAGAACG GTCAAGTGGC CGTGGTCAGC ATCCCGCTGG TCGGCGACCC AGACCATTCC
CAGGCCCGCC GCGGGCTCGA CCGGCTCCGC ACCAACCTGA TCCCCGCCGC CTTCGACGGG
ACCGACACCA CCGTGCTGGT CGGCGGCACC ACCGCCGAGA ACGTCGACTA CTCCGACGTC
ATCAACCACT GGCTGCCGTT AGTGCTCGCC TTCGTCCTCG GGCTGAGCTT CCTCCTGCTC
ACCCTGGTCT TCCGCTCCCT GGTCCTCTCC GCCACCGCGG TCGTCCTCAA CCTGCTCTCG
GTCGGCGCGG CCTACGGCCT GCTCGTGCTG GTCTTCCAGC ACGGGGTCGG GGCCGACCTA
CTGGGCCTCC AGCAGGTCGA ACGGGTCGAG GCGTGGGTCC CGGTGTTCCT GTTCTCGGTG
CTGTTCGCAC TCTCGATGGA CTACCACGTC TTCCTGCTCA GCAGGATCCG GGAACGCTAC
GCACAGACCG GCGACACCAA CGAAGCCGTC GTGCACGGCA TTGCCGCCAC CGGCCGCCTC
ATCACCGGAG CCGCGCTGAT CATCGTGGTC GTCTTCACCG GCTTCGCCGC CGGCCAGCTC
GTCATGTTCC AACAGATGGG CTTCGGTGTC GGCGTCGCCC TGCTCGTCGA CGCCACCCTC
ATCCGAATGG TCGTCGTCCC CGCCGCGATG GCCCTCCTCG GCAAGTGGAA CTGGTACCTG
CCCAGCTGGC TCAACTGGCT CCCCGAGCTC CACGTCGAGG GCGACCCCGA CTCCCTGGCA
CCCACCGCCG ATGCGCCCAT TGCTGTGGCA GCCCCGCGGA CCCCGATCTC GGTGTAG
 
Protein sequence
MFATRSKDLE GHRVHASGTS ITQRLARAAA VHPWRVVVAW GLILLASMAA IATLIGSAFT 
SDGNITSNPE SARAGQVMAE NFSQDDRIDD AVIIYSADLT AENPEFRAFV ADFRSSIETT
GAAETVRDPY AADGAGVSQD GHAAVVTLVL GHDSATGIVD VIDEVVAADA DAAFEVDITG
TNTLDHDFTE LSESDLTNGE VKFGLPAAMI VLVLVFGALV AAFLPMSIAI LSIIVTVAIS
SILGQFTSLS FFIVNMITAM GLALGIDYSL FVLSRYREER QAGRDKIDAI IATGGTSSKA
VLFSGSSFVV ALLGLLLVPD TILRSLALGA IIVGLVTMAA ALTLLPALLS LLGDRVNALR
LPIVGRDHPA ESPFWTRAVS AVVRRPGTFL AAGLLVLLAA ALPVLGLHTG TSGVNSLPDD
TYAKAGALAL ERSFPGTSTT DPAQVVISGD VTSSDVTAAI AELEASVADD PDFGPSQQRL
AENGQVAVVS IPLVGDPDHS QARRGLDRLR TNLIPAAFDG TDTTVLVGGT TAENVDYSDV
INHWLPLVLA FVLGLSFLLL TLVFRSLVLS ATAVVLNLLS VGAAYGLLVL VFQHGVGADL
LGLQQVERVE AWVPVFLFSV LFALSMDYHV FLLSRIRERY AQTGDTNEAV VHGIAATGRL
ITGAALIIVV VFTGFAAGQL VMFQQMGFGV GVALLVDATL IRMVVVPAAM ALLGKWNWYL
PSWLNWLPEL HVEGDPDSLA PTADAPIAVA APRTPISV