Gene Ndas_2624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2624 
Symbol 
ID9246475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3130122 
End bp3131150 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content75% 
IMG OID 
Product4-hydroxythreonine-4-phosphate dehydrogenase 
Protein accessionYP_003680547 
Protein GI297561573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.159271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00648281 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCGTC CCACACTGGC AGTCACCCTG GGCGACGTGG CGGGGATCGG CCCCGAGATC 
ACGGCCAAGG CGCTGCTGCA CCACCCCGAG GTCCGCGAGT ACGCCAAGCC GGTCGTGGTC
GGTGACGCCG ACGCGCTGCG CAACGCCGTC GCCGCCGTCG GCGGGGACCC GGAGGCGGTC
AACACCGTCG CCTCCCCGGC CGAGGCCGCC GACGAGCCCG GCGTGATCGA CGTCGTGCAG
ACCGGCCCCT CGCTGGGCCA CGTCCCCCCG GGCGAGCTGA GCGCGGAGGC GGGCGACGGG
GCGGCCCGGT TCGTCATCGC CGCCGTGGAC CTGGCCAAGC GCGGCCTGGT GGAGGGCATC
GTCACCCCGC CGCTGAACAA GGCCGCGATG CACCTGGGCG GCCACGCCTG GCCCGGGCAC
ACCGAGCTGC TCGCGCACGA GTTCGGGGTG AAGGACTACA GCCTGGTGCT GTCGGCGGAC
GAGCTGTCCT TCTTCCACCT GACCACGCAC GTGTCGCTGC GCCAGGCCAT CGAGGGCGTC
ACCCAGGAGC GCACCCTCCA GGTGCTGCGC CTGATGAGCG CCTTCGCCCG CGCCCAGGGC
AGCCCGGACG AGCCCATCGG GGTGGCGGGC CTGAACCCGC ACGCGGGCGA GAACCGCCTG
TTCGGCGACG AGGACGCCGA CGTCCTGGCG CCCGCGATCG CCCGCGCCCG CGAGGAGGGC
ATCAACGCCC ACGGCCCGCT CCCGGCCGAC GCCCTGATCC CGGCGGCGGT CAAGGGCAAG
TGGAAGCTGG TCGCGGTCTG CTACCACGAC CAGGGGCACG CGCCCTTCAA GGCGGTCTAC
GGGGACGACG GGGTCAACAT CACCGCGGGC CTGCCGGTGG TGCGCGTCTC GGTCGACCAC
GGCACGGCCT TCGACATCGC GGGCCGGGGC ATCGCCCGCG AGGCCAGCCT CGTGCTGGCG
ATCCGCCGCG CGGCCGAGCT GGCCCCCGGC TGGGGCCACG TCTGGCAGGC CACCCGCTCC
GAGGGGTAG
 
Protein sequence
MSRPTLAVTL GDVAGIGPEI TAKALLHHPE VREYAKPVVV GDADALRNAV AAVGGDPEAV 
NTVASPAEAA DEPGVIDVVQ TGPSLGHVPP GELSAEAGDG AARFVIAAVD LAKRGLVEGI
VTPPLNKAAM HLGGHAWPGH TELLAHEFGV KDYSLVLSAD ELSFFHLTTH VSLRQAIEGV
TQERTLQVLR LMSAFARAQG SPDEPIGVAG LNPHAGENRL FGDEDADVLA PAIARAREEG
INAHGPLPAD ALIPAAVKGK WKLVAVCYHD QGHAPFKAVY GDDGVNITAG LPVVRVSVDH
GTAFDIAGRG IAREASLVLA IRRAAELAPG WGHVWQATRS EG