Gene Noca_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1541 
Symbol 
ID4595481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1634291 
End bp1635604 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content67% 
IMG OID639776140 
Producttype II secretion system protein E 
Protein accessionYP_922742 
Protein GI119715777 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACC TCGACTACGC ACTGATCAAG CGGCTGCAAG GCGAACTCGG TCGGCTCCGA 
CAGGAGGAGA TCCTGCGGCG GCGGAGCGCC AACCTCCCAG CGCTCACCGG TCCCGACGCG
GTCCAGCACG GCAAGGCGCT GGTCCAAACG GTTGTCGGAG ACTACGAGTC GAACCTCGTC
GAGACAGGTT CCGAGCCCAT CACGTGGGAA GCGCGGCAGG ATCTCGTCGA AGCGCTGGAG
TCCCGTCTCT TCGGAGCGGG CAGCCTGCAG GCACTACTCG ACGACGCGAA CGTCGAGAAC
ATCGACATCA ACGGCTTCCA ACACGTGTAC GTCGAGTACG CCGACGGCAC TACCGCGAAG
GTGCGACCGA TCGCCGGATC GGACGAGGAG CTCGTCGAGA CCGTCCAGAC GCTCGCGGCA
CACGAGGGGC TCTCGGCTCG GGCCTTCGAC GTCGCCAACG TTCGCGTGAA TCTCCGGCTC
CCGGATGGTT CGCGCCTCTA TGCCGTCCAG TCGGTGACGA AGCAGCCGGT CGTCTCGATC
CGACGCCACC GCCACCCTCG CGTGACTCTC AAGGACCTGA TCGGCCTGGA GACGATCGAC
GAGGAGATGG CCGACTTCTT GGCAGCACTG GTCAGAGCCC GCAAGAACGT CATGGTCGCC
GGGGCCACCA GCGCGGGGAA GACGACCATG CTCCGGGCAC TTGCTTCCGA GATCGGTCCG
GACGAACGCA TCCTCACAGT CGAGCGTTCG CTCGAGCTCG GCCTCGACGA GGATCTGGAA
CACCACCCCA ACGCGATCGC GTTCGAGGAG CGCCTGCCGA ACGTCGAAGG GGCCGGCGCC
GTCACGATGG CCGAGTTGGT CCGCGACACC CTTCGCATGA ATCCCTCCCG CGTGATCGTC
GGGGAGGTCC TCGGCGACGA GGTCGTCACG ATGCTCAACG CGATGACCCA GGGCAACGAC
GGCTCGCTGT CCACGATCCA CGCGAATTCG TCCTCCGACG TCGTCCACAA GATTGCGACG
TACGCCATCC AGGCGCCCGA ACGACTGCCT TGGGAGGCGA CCGTACGGCT GGTCGCGACG
GCGCTGGACT TCGTGGTGTT CATCCGTCGG GTGCGCGGCG AGGACGGGCA GCGACGGGTC
GTCGAGTCGA TCCGCGAGAT CGCCGGGATC AGCGACGACG GCCAGCTCCA GACCAACGAG
CTGTGGGCAC CGGATTCGTT CGGCAACGTC GTACGACGCC ACGGCGTCCA GGTGCGAGCC
CACGACGACC TGGTGGCGGT GGGTTGGCAG CCGGAGCCGG GTGGGTGGTC GTGA
 
Protein sequence
MTDLDYALIK RLQGELGRLR QEEILRRRSA NLPALTGPDA VQHGKALVQT VVGDYESNLV 
ETGSEPITWE ARQDLVEALE SRLFGAGSLQ ALLDDANVEN IDINGFQHVY VEYADGTTAK
VRPIAGSDEE LVETVQTLAA HEGLSARAFD VANVRVNLRL PDGSRLYAVQ SVTKQPVVSI
RRHRHPRVTL KDLIGLETID EEMADFLAAL VRARKNVMVA GATSAGKTTM LRALASEIGP
DERILTVERS LELGLDEDLE HHPNAIAFEE RLPNVEGAGA VTMAELVRDT LRMNPSRVIV
GEVLGDEVVT MLNAMTQGND GSLSTIHANS SSDVVHKIAT YAIQAPERLP WEATVRLVAT
ALDFVVFIRR VRGEDGQRRV VESIREIAGI SDDGQLQTNE LWAPDSFGNV VRRHGVQVRA
HDDLVAVGWQ PEPGGWS