Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3884 |
Symbol | |
ID | 9247755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4655270 |
End bp | 4656427 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | phosphoribosylaminoimidazole carboxylase, ATPase subunit |
Protein accession | YP_003681787 |
Protein GI | 297562813 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.964763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGAGC GTAACCGCAC CGTCCCGCGC GTGGGAATGG TGGGAGGGGG CCAACTCTCC CGGATGACCC ACCAGGCGGG CATCGCCCTG GGGGTCGACT TCTCGGTCCT GGCGTCCAGC CCCGCCGACA GCGCCGCGCT GGTGTGCGGT GACGTCTCCC TGGGCGACGA CCGCGGCCTC GACGACGTCC TGGCCTTCGC CAAGGCCCAC GACGTGGTCA CCTTCGACCA CGAGCACGTG CCCGAGCCCG TCCTGCGCGC GGTCGAGGAG GCCGGGGGCC TGCTGCGCCC CGGCCGCGAC GCGCTGCGCT TCGCCCAGGA CAAACTGCGC ATGCGCACCC GTATGGCCGA GCTGGGCGCG CCCTCGCCGC GCTGGCGTGC CGTCACCACC CTGGAGCACG TCACGGCCTT CGCCGGGGAG ACCGGGTGGC CGGTCGTGCT CAAGGCGGCC CGCGGCGGCT ACGACGGCAA GGGCGTGTGG GTCGTCGGTG ACGCCGACGA GGCCCGCGGG GTCGTGGACC GCGCCGCCGC CGAGGAGGTG CCGCTCCTGG TCGAGGGGAA GGTGGACTTC TCGCGCGAGC TGGCCGTGCA GGTCGCCCGC TCCCCGCACG GGCAGGTCGC GGTCTACCCG GTCGTGGAGA CCGTGCAGCG CGGCGGCATC TGCCACGAGG TGATCGCCCC CGCCCCGGAC CTGTCCGAGG ACAAGGCCAC CCACGCCCAG CAGCTGGCCA TCGAGATCGC CCAGGCGCTG GACGTGACCG GGGTCCTGGC CGTGGAGCTG TTCGAGACCG CCGACGGCGT GGTCGTCAAC GAGCTGGCCA TGCGCCCGCA CAACTCCGGC CACTGGAGCA TCGAGGGCGC GCGCACCTCC CAGTTCGAGC AGCACCTGAG GGCCGTGCTG AACCTGCCGC TGGGCTCGCC GCGCACCAAC GCGCCCTACA CCGTCATGGC CAACCTGCTG GGCGGCGAGG ACCCCGAGGT CTACCGCCGC TACCTGCACG TGATGGCGAA GGACCCCGAG GTGAAGGTGC ACTTCTACGG CAAGGACGTG CGTCCGGGCC GCAAGATCGG GCACGTCACC GTGATGGGTG AGGACTACCG TGACCTGCTG GCGCGCGCGC GAGACGCCGC CGCCTACCTG CGAGGAGACG AACAGTGA
|
Protein sequence | MSERNRTVPR VGMVGGGQLS RMTHQAGIAL GVDFSVLASS PADSAALVCG DVSLGDDRGL DDVLAFAKAH DVVTFDHEHV PEPVLRAVEE AGGLLRPGRD ALRFAQDKLR MRTRMAELGA PSPRWRAVTT LEHVTAFAGE TGWPVVLKAA RGGYDGKGVW VVGDADEARG VVDRAAAEEV PLLVEGKVDF SRELAVQVAR SPHGQVAVYP VVETVQRGGI CHEVIAPAPD LSEDKATHAQ QLAIEIAQAL DVTGVLAVEL FETADGVVVN ELAMRPHNSG HWSIEGARTS QFEQHLRAVL NLPLGSPRTN APYTVMANLL GGEDPEVYRR YLHVMAKDPE VKVHFYGKDV RPGRKIGHVT VMGEDYRDLL ARARDAAAYL RGDEQ
|
| |