Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4729 |
Symbol | |
ID | 4595479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008697 |
Strand | - |
Start bp | 28529 |
End bp | 30028 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639772518 |
Product | type II secretion system protein E |
Protein accession | YP_919178 |
Protein GI | 119714036 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.103918 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGCA ACAACGGAGC CAGCTCGAAT GGCCGTCGCG CGCACCTGAA CGGGATGCAC CTGAAGCAGC TCGTGCCGGT CGACCAGGTA CTGGTCAAGC GTCTCCAGTC CCGCGTGGGT GCTCGCCGCG GGAAGGCGCT GGAGGAGTTC CGCCGCAACA GCCAGCCGAT CCCCAAGGGC GAGGACGCCC GGCAGCACAC CGAAGCCCTG CTGGCCTCGG TGCTGGCCGA GTACGAGTCC GACCTGGTCG AGGACGGCCA CAACCCGCTG GAGGAAGACG CCCGGGACCG GCTGGAGGCC GCGCTCAAGG CCCAGTTGTT CGGCGCAGGC AACCTCGATC AGCTGCTGGA GGATCCCGAG GCCGAGGACA TCGTCATCAA CAGTTGGCAG AACGTGTTCG TCACCTACGC CGACGGCACC AAGGCGCAGA TGCCGCCGGT CGCCTCCTCG GACAAGGAGC TGGAGGAGAT CGTCAAGACG ATTTCTGCCC ACGACGGCCT GTCCACCCGT GCCTTCGATC TGATCAACTA CAACGTCACC CTCAGGCTGC ACGACGGCTC ACGTCTGCAC GCGGTCCAGG GGGTGAGCGC GAACGGTCTG TCGATCTCGA TCCGCAAGCA CCGCCACAAG AGGGCCACCC TGCTTCCGGT TCCCGGGCTG GCCGAGCGGG AGCGCGCCGC CGGGGTTCCG GAGTCGCTGC GCACCAAGGA CCTGCTGAGC GAGGGAACCG TCGACCACGA CCTCGCCGCG TTCCTGTCCG CGCTGGTGAA GTCGCGTCAG AACATCATGA TCGCCGGAGC CGTAAGTGCG GGGAAGACCA CGCTTGTGCG AGCGCTGGCC TCCGAAATAG ACCCGATCGA GCGTCTGGTC ACCGTGGAGC GGTCCATCGA GCTCGGCCTG CACGAGGACC CCGAGCGCCA CCCCGACATC GTGCCGCTCG AGGAGCGGCT GGCGAACGTC GAAGGAGAAG GCGCAGTCGG GTTGGCCGAC CTGGTGCGCA ACTCGCTGCG GATGAACCCC TCGCGGGTGA TCGTGGGCGA GGTCTTGGGT GATGAGGTCA TCACGATGCT CAACGCCATG GCCCAGGGCA ACGACGGGTC GCTGAGCACG ATCCACGCGA ACTCCTCGCG CGATGTCATC GGGAAGATCC AGACCTACGC TCTGCAGGCT CAGGAGCGGT TGCCGTTCGA GGCCACCAAC GGCCTGATCG CGAACGCACT GAACTTCATC GTGTTCCTTC GCCGGATCCG CACCGAGGAC GGCCGTCAGC GCCGGATCGT CGAGTCGATC CGTGAGGTTG CCGGACGTGA TGAGGACGGC GTGAAGACGA CCGAGCTGTG GAAGTACAAC AGGGCCACCG GTCGCACCGA GTTCACCCGG AAGGCGATCA TCCGCGAGGA AGCCCTCCTC GATGTCGGCT GGGACCCCGA CGGCACGACG GACCTGAACA GGTTCGCCGA GCCGAGCGGT GCCAACGGCG ATGAGGGGTG GCAGATCTGA
|
Protein sequence | MNSNNGASSN GRRAHLNGMH LKQLVPVDQV LVKRLQSRVG ARRGKALEEF RRNSQPIPKG EDARQHTEAL LASVLAEYES DLVEDGHNPL EEDARDRLEA ALKAQLFGAG NLDQLLEDPE AEDIVINSWQ NVFVTYADGT KAQMPPVASS DKELEEIVKT ISAHDGLSTR AFDLINYNVT LRLHDGSRLH AVQGVSANGL SISIRKHRHK RATLLPVPGL AERERAAGVP ESLRTKDLLS EGTVDHDLAA FLSALVKSRQ NIMIAGAVSA GKTTLVRALA SEIDPIERLV TVERSIELGL HEDPERHPDI VPLEERLANV EGEGAVGLAD LVRNSLRMNP SRVIVGEVLG DEVITMLNAM AQGNDGSLST IHANSSRDVI GKIQTYALQA QERLPFEATN GLIANALNFI VFLRRIRTED GRQRRIVESI REVAGRDEDG VKTTELWKYN RATGRTEFTR KAIIREEALL DVGWDPDGTT DLNRFAEPSG ANGDEGWQI
|
| |