Gene Noca_4729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4729 
Symbol 
ID4595479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp28529 
End bp30028 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content66% 
IMG OID639772518 
Producttype II secretion system protein E 
Protein accessionYP_919178 
Protein GI119714036 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.103918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGCA ACAACGGAGC CAGCTCGAAT GGCCGTCGCG CGCACCTGAA CGGGATGCAC 
CTGAAGCAGC TCGTGCCGGT CGACCAGGTA CTGGTCAAGC GTCTCCAGTC CCGCGTGGGT
GCTCGCCGCG GGAAGGCGCT GGAGGAGTTC CGCCGCAACA GCCAGCCGAT CCCCAAGGGC
GAGGACGCCC GGCAGCACAC CGAAGCCCTG CTGGCCTCGG TGCTGGCCGA GTACGAGTCC
GACCTGGTCG AGGACGGCCA CAACCCGCTG GAGGAAGACG CCCGGGACCG GCTGGAGGCC
GCGCTCAAGG CCCAGTTGTT CGGCGCAGGC AACCTCGATC AGCTGCTGGA GGATCCCGAG
GCCGAGGACA TCGTCATCAA CAGTTGGCAG AACGTGTTCG TCACCTACGC CGACGGCACC
AAGGCGCAGA TGCCGCCGGT CGCCTCCTCG GACAAGGAGC TGGAGGAGAT CGTCAAGACG
ATTTCTGCCC ACGACGGCCT GTCCACCCGT GCCTTCGATC TGATCAACTA CAACGTCACC
CTCAGGCTGC ACGACGGCTC ACGTCTGCAC GCGGTCCAGG GGGTGAGCGC GAACGGTCTG
TCGATCTCGA TCCGCAAGCA CCGCCACAAG AGGGCCACCC TGCTTCCGGT TCCCGGGCTG
GCCGAGCGGG AGCGCGCCGC CGGGGTTCCG GAGTCGCTGC GCACCAAGGA CCTGCTGAGC
GAGGGAACCG TCGACCACGA CCTCGCCGCG TTCCTGTCCG CGCTGGTGAA GTCGCGTCAG
AACATCATGA TCGCCGGAGC CGTAAGTGCG GGGAAGACCA CGCTTGTGCG AGCGCTGGCC
TCCGAAATAG ACCCGATCGA GCGTCTGGTC ACCGTGGAGC GGTCCATCGA GCTCGGCCTG
CACGAGGACC CCGAGCGCCA CCCCGACATC GTGCCGCTCG AGGAGCGGCT GGCGAACGTC
GAAGGAGAAG GCGCAGTCGG GTTGGCCGAC CTGGTGCGCA ACTCGCTGCG GATGAACCCC
TCGCGGGTGA TCGTGGGCGA GGTCTTGGGT GATGAGGTCA TCACGATGCT CAACGCCATG
GCCCAGGGCA ACGACGGGTC GCTGAGCACG ATCCACGCGA ACTCCTCGCG CGATGTCATC
GGGAAGATCC AGACCTACGC TCTGCAGGCT CAGGAGCGGT TGCCGTTCGA GGCCACCAAC
GGCCTGATCG CGAACGCACT GAACTTCATC GTGTTCCTTC GCCGGATCCG CACCGAGGAC
GGCCGTCAGC GCCGGATCGT CGAGTCGATC CGTGAGGTTG CCGGACGTGA TGAGGACGGC
GTGAAGACGA CCGAGCTGTG GAAGTACAAC AGGGCCACCG GTCGCACCGA GTTCACCCGG
AAGGCGATCA TCCGCGAGGA AGCCCTCCTC GATGTCGGCT GGGACCCCGA CGGCACGACG
GACCTGAACA GGTTCGCCGA GCCGAGCGGT GCCAACGGCG ATGAGGGGTG GCAGATCTGA
 
Protein sequence
MNSNNGASSN GRRAHLNGMH LKQLVPVDQV LVKRLQSRVG ARRGKALEEF RRNSQPIPKG 
EDARQHTEAL LASVLAEYES DLVEDGHNPL EEDARDRLEA ALKAQLFGAG NLDQLLEDPE
AEDIVINSWQ NVFVTYADGT KAQMPPVASS DKELEEIVKT ISAHDGLSTR AFDLINYNVT
LRLHDGSRLH AVQGVSANGL SISIRKHRHK RATLLPVPGL AERERAAGVP ESLRTKDLLS
EGTVDHDLAA FLSALVKSRQ NIMIAGAVSA GKTTLVRALA SEIDPIERLV TVERSIELGL
HEDPERHPDI VPLEERLANV EGEGAVGLAD LVRNSLRMNP SRVIVGEVLG DEVITMLNAM
AQGNDGSLST IHANSSRDVI GKIQTYALQA QERLPFEATN GLIANALNFI VFLRRIRTED
GRQRRIVESI REVAGRDEDG VKTTELWKYN RATGRTEFTR KAIIREEALL DVGWDPDGTT
DLNRFAEPSG ANGDEGWQI