Gene Noca_2222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2222 
Symbol 
ID4598720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2369027 
End bp2370706 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content70% 
IMG OID639776822 
Producttype II secretion system protein E 
Protein accessionYP_923415 
Protein GI119716450 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCA ACGGACACCA CCCGGAGACC AACCGCCGCG ACCCGCTGCG CGCAGACGAG 
TGGCTTGAGG CCCGCCATCC CGCGAACCGT GAGCAGACCT CGCCGTTCGC CCGAGGACGC
GGTTCCAACG GCACCCCGCC GCCCCCGCCG GTCGTCGAGG ACGACGACCC GACCTCGCTG
CCGATCTTCG CCGGCGCCTG GACCAGCGAA GGCGAGGGCC AGATGCCTGG CCGCGCCCGC
TCGGAGTTCA GCCTGCGCCC GCTTGTGGCG CCCGCACCGG AGCAGCACCA AGACACCCGC
GGCACGGATG GCGAGGTCGA GCTGGACTGG GAGCTGATCG CGCAATACCG CGCTGAGATC
TCCGCGCGAC TGACCGCCCG GCTCGACAAG GAAGGTGGTC GGGTCACCGA GGAGGACCGC
GAGCAGATGG GCCTCGACGT CATCGAGGAG CTCATCAAAT CCGAGGCCGA GACGTTGGTC
TCGACCGGCC GTCCGCCGTG GACGAAGGAC CACGAGAAGG CACTCAAGTC CGCCCTCCAC
GCCGCCCTGT TCGGGCTCGG CCGCCTGCAG CCCCTGGTCG AGCGCGAGGA CGTCGAGAAC
ATCATCGTCA TCGCCCGCGG CCCGGTCTGC TCGGTGTGGC TGGAGCTGGT CGACGGCACC
CTTGTGGAGG CCGCCCCGAT CGCCGACTCC GAGGACGAGC TGCGCGAGTT CCTCTCCGAC
CTAGGCGCAC GGCAGAACCG GCCCTTCACC GAGGCCCGGC CGCACCTGGA CCTCCGGCTG
CCCGGGGGAG CGCGGCTCGC GGCCGGCTCC TGGGTGATGG CCTACACCTC CGTCGTGATC
CGTCGCCACA GCATGCGCGA GGTGTCGATG GACGAGATGG TCTACGACCG CAAGGCGTGC
AGCGCGGTCC TGGCCGACTT CGTCGCAACC TGCGTGCGGG CGGGCAAGAG CATCGTCGTC
TCCGGGGTCC AAGGCAGCGG CAAGACCACC TGGGTCCGGG CCCTGTGCTC GTGCATCCCG
CCCTGGGAGA TGATCGGCAC CTTCGAAACC GAGTTCGAGC TGCACCTGCA CGAGCTCGTC
GACCGCCACA AGATCGTCCA CGCGTGGGAG CACCGCCCCG GATCCGGCGA GGTCGGCATC
GACGGCCGCC AGGCCGGTGA GTTCAGCCTC GAGGAGGCCA TCCACCACTC CTTCCGGTTC
AGCCTCGCCC GCCAGATCGT CGGTGAGGTC CGCGGCCCGG AGGTCTGGAA CATGCTCAAG
GCCATGGAGT CCGGGCCGGG CTCGATCAGC ACCACCCACG CCCGCAGTGC CGAGCACACG
ATCGAGAAGC TCGTCTCCTG CGCCATGGAG AAAGGCCCCC AGGTCACCCG CGAGCTGGCG
ATCAGCAAGC TGGCCGCCGC GATCGACATC GTGATGTACC TGCGCTCGGA GGTCGTCGCC
AATCCCGACG GCACCTTCCG CAAGCAGCGC TGGGTCGAGG AGGTCCTGGT CGTCCAGCCC
AGCATCGACG CCGCCAGGGG ATACGCCACC ACCCCGATCT TCACCCCTAA CCAGCTCGGC
CAGGCCGTCG CGACCGGCAA GCTCGACAAC TTCCTCGCCC AGGAGCTGGC GCGGCATGGG
TTCGACCTCG AGGCGTACAA GGCCGAGTCC CAGGCCAACC CGGGGGTGGC CACCTCATGA
 
Protein sequence
MSTNGHHPET NRRDPLRADE WLEARHPANR EQTSPFARGR GSNGTPPPPP VVEDDDPTSL 
PIFAGAWTSE GEGQMPGRAR SEFSLRPLVA PAPEQHQDTR GTDGEVELDW ELIAQYRAEI
SARLTARLDK EGGRVTEEDR EQMGLDVIEE LIKSEAETLV STGRPPWTKD HEKALKSALH
AALFGLGRLQ PLVEREDVEN IIVIARGPVC SVWLELVDGT LVEAAPIADS EDELREFLSD
LGARQNRPFT EARPHLDLRL PGGARLAAGS WVMAYTSVVI RRHSMREVSM DEMVYDRKAC
SAVLADFVAT CVRAGKSIVV SGVQGSGKTT WVRALCSCIP PWEMIGTFET EFELHLHELV
DRHKIVHAWE HRPGSGEVGI DGRQAGEFSL EEAIHHSFRF SLARQIVGEV RGPEVWNMLK
AMESGPGSIS TTHARSAEHT IEKLVSCAME KGPQVTRELA ISKLAAAIDI VMYLRSEVVA
NPDGTFRKQR WVEEVLVVQP SIDAARGYAT TPIFTPNQLG QAVATGKLDN FLAQELARHG
FDLEAYKAES QANPGVATS