Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3829 |
Symbol | |
ID | 4595894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4051046 |
End bp | 4051987 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 639778437 |
Product | ApbE family lipoprotein |
Protein accession | YP_925016 |
Protein GI | 119718051 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0256757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCCG CCACCGCCCG GTCCAGCGCA CTGGGCACCT ACGTCTTCCT GGCCACCCGC CGGGCGGCCG ACCTCGACAC CGCGAGCCGG ATCGCCCGCC ACGTGCTCGA CGACGTCGAC CGCACCTGCA GCAGGTTCCG GCCCGACTCC GACCTCTCCC GCGCCAACGC GGGCGCGGGC GGCTGGGTCG CGGTCGACCC GGTCCTCGTC GCCGCCGTCA CCGCCGCCTG CGCCGCGGCC GAGGACACCG ACGGGCTGGT CCACCCGCTG CTCGGACGGA ACCTGGTGGA GCTGGGCTAC GACCGCGACT TCGCCGCGCT CGCCGCCGTC GAGGACGACC GGGTGCCGGC GCAGCTGTGG CCCACCACGG CGCCGGGCCG GGACCGCTGG CGCGAGATCG GACTCGACGC CGACGGTGCG ATCAGGGTGC CCGCCGGCAC CGCCCTGGAC CTCGGCGCGA CGGGCAAGGC CTGGGCCGCG GACGTGATCG CGACCGCGTT CGCCGAGGAG CTCGGCGGAC CCGCCCTGGT CAGCCTCGGC GGCGACCTGG CCATCGCGGC GCCCGACGGG CAGCCCTGGC CGGTCGCGAT CTCCACCCAT CCGGACGGAC CGGTCGAGAC GACCATCGCG CTGGACCGGG GCGGCCTCGC GACCTCGAGC ACCCGGGTGC GCCGTTGGTC CCGCCGCGGG ACCGACCTCC ACCACCTGCT CGACCCACGC ACCGGCCGGC CCGCGCCGGA GGTGTGGCGC ACCGTGACCG CGACCGGGCC CACCTGCACG GCGGCCAACA CCGCCTCGAC GGCCGCGGTC GTGCTGGGGC GGGACGCACC GGCCTGGCTG ACCGGCCGCG GAGTCGCCGC GCGCCTCGTC GACCGCACCG GGCGGGTGCG CACGACCGGC GCCTGGCCCA CCGACACCGA CGAGCACAGG AGACCCGCAT GA
|
Protein sequence | MTAATARSSA LGTYVFLATR RAADLDTASR IARHVLDDVD RTCSRFRPDS DLSRANAGAG GWVAVDPVLV AAVTAACAAA EDTDGLVHPL LGRNLVELGY DRDFAALAAV EDDRVPAQLW PTTAPGRDRW REIGLDADGA IRVPAGTALD LGATGKAWAA DVIATAFAEE LGGPALVSLG GDLAIAAPDG QPWPVAISTH PDGPVETTIA LDRGGLATSS TRVRRWSRRG TDLHHLLDPR TGRPAPEVWR TVTATGPTCT AANTASTAAV VLGRDAPAWL TGRGVAARLV DRTGRVRTTG AWPTDTDEHR RPA
|
| |