Gene Noca_2821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2821 
Symbol 
ID4596111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2990635 
End bp2991849 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content72% 
IMG OID639777426 
Productvirulence factor Mce family protein 
Protein accessionYP_924010 
Protein GI119717045 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.383501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCACCG GGTTGTGGAG CCGGGTCAAC GGCCGCGCGC TCGCGGTCGC GGCCGCTGTC 
GTGCTGCTGG CGGCGACGTA CTTCATCGTG CTGCGCGACG ACACCGCGAC CAAGACCGTG
AGCGCCCACT TCCCGCGGGC GGTCAGCATC TACGAGGGCA GCGACGTGCG GATCCTCGGC
GTGAACGTCG GCCGGGTGAC CGCGGTGACG CCGGAGGGCA ACTCCGTGCG CGTGGACATG
GAGTACGACG CGGAGTACCA GGTGCCGGCC GACGCCCAGG CCGTGATCGT GACTCCGACC
CTGGTCGCGG ACCGGTTCGT CCAGCTCACG CCGGCCTACG CCGAGGGCGA CCGGGTGCTG
GCCGACGGCG CGGACATCGC GCTGCCCGAC ACCGGTGTCC CGGTCGAGCT GGACCGGATC
TACGCGAGCC TGCGCGACCT CTCCGAGGCC CTCGGCCCCA ACGGCGTCAA CAAGGACGGC
ACCCTCGACC ATCTGCTCGA GGCCGGGGCG CACGCGTTGG ACGGCAGGGG CGCGCTCGGC
AACCGGATGC TCACCCGGCT CGCCGCGGCC GCGCGGACGT TCGGCGAGGG AGCGGGTCCG
CTGTTCGACA CCGTCAGCCG GCTCGCCGAG TTCACCACCA CGCTCGCGGA GAACGGCAAG
TTCGTCCGGG CGTTCATCAA GGACCTCGCC GGCGTCTCGT CCCAGCTCGC GGACGAGCGA
ACCGAGATCC AGGGAGCGCT CGCGGCGGTC GCGGACGCGG TCGGGACCGT GAAGTCGTTC
GTGCACGACA ACCGTGCGGC GCTGGTCGCG GACGTCGAGC GACTCACCCG GGTGATGAAG
ACCATCGCCT CCGAGAAGGA CAGCATCGAC ACCGCGCTGC GCGTCGCGCC CGTAGCCATC
GGCAACCTCA GCCTGGCCTA CAACAGCAGG TCCGGGACGA TCGGCTCCCG CATCGGCATC
AGCGGCAACG TGTGGGACGC CGACGGCTTC CTGTGCGCCG TGGTCCAGCA GTCCAGCCTC
TCGCGGGCCA GCAAGGACCT GGCGTGCACG CTGTTCAAGC AGCTTCTCGA GCCGGTCGAG
GGCCAGGTGC CGACCATCCC GCCCGGGCCC GACGGCCGGT CGTCGACGGG CGATCAGGTG
CCGCGCCAGG TGCAGCGTCA GTACGCCGGA GCCGGCGGCG GGTCGCTCGG CCAGCTGATG
GGGGGCGGCT CGTGA
 
Protein sequence
MITGLWSRVN GRALAVAAAV VLLAATYFIV LRDDTATKTV SAHFPRAVSI YEGSDVRILG 
VNVGRVTAVT PEGNSVRVDM EYDAEYQVPA DAQAVIVTPT LVADRFVQLT PAYAEGDRVL
ADGADIALPD TGVPVELDRI YASLRDLSEA LGPNGVNKDG TLDHLLEAGA HALDGRGALG
NRMLTRLAAA ARTFGEGAGP LFDTVSRLAE FTTTLAENGK FVRAFIKDLA GVSSQLADER
TEIQGALAAV ADAVGTVKSF VHDNRAALVA DVERLTRVMK TIASEKDSID TALRVAPVAI
GNLSLAYNSR SGTIGSRIGI SGNVWDADGF LCAVVQQSSL SRASKDLACT LFKQLLEPVE
GQVPTIPPGP DGRSSTGDQV PRQVQRQYAG AGGGSLGQLM GGGS