Gene Ndas_2861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2861 
Symbol 
ID9246712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3415545 
End bp3416906 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680778 
Protein GI297561804 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCTG ATTCCCCACC GGCCCGGTAC ACGGCGCGGA CGCTGGCCGC GACCTGCGCC 
GTGCTGCTCC TGTACCCCGC GACCGGTGCC CTGGCCCAGG AGTCGGACGC GCAGGACCCC
GGCGCCCAAA GCCCACAGGA CCGCACCCAG GCCATCGCCG ACGCGCTGGA GGAGTCGCCG
GTCTACGTCG ACCCCGCCTA CGAGTCCGCC TTCCCGGCCC AGGACCAGGA GCGGGTCGCC
GGGGTCATCG AGTCCTCCGA CCTGGACCTG TACGTGATCG CGGTGCCCCT GACGTCCGGC
GACGTCTGGA ACGGTGACGC GGCCACGCTC GTGTCGGTCG TGCACGACCG CAGGGGCGGC
GGCGAGGCGC ACTACCTGGC CTACGACGAG GTGTCGGGGT TCCACGGGGA GGACTACGGT
CCCGGTGTCC AGGAGGCGGC CCCCGCGCAC CACGGGGCCC TGGCCGCCTC CTACGGGACG
GACTTCGACG GCTCCATGCT CCAGCAGGCC GACGCGGCCG TGGAGGCCGC GCTGTCCGAC
GACCCGCGCG CGGCCTACGA GGCGGCCCTC CGGTCCTACG AGGAGGCCAA CCCCCACATC
ACCGGCATGG GCGGGGACGC CGGGCAGGGG AGCGGTCCGG GATCCGGCCC GCTGTTCGCG
GTCGGGGCCG TCATCGCGGT CCTGCTCGCG ATCCTGTTGG CGCTGTTCGT GCGCGGCCGC
CTCACCGCGC GCGTGGGCAG GCGCAGGGCG GCCTCCATCA CCCAGCACGC GGCCTTCGAC
AACGCCGACC GCGCCCAGCT GGACTCCCTG GTCGAACAGG GCGAACGCGA CCTCATCGAG
GTGGGCGAGC GCCTGAGCCG CATGGACGCG GACCAGGGGT CCGACACCAG GTCGGGCCAG
AGCCTCCAGC GCGCCCTGGA CGCCCGCTCC GCCGCCGCGG CGGTCCACGA CCGGATGGCC
GCCGAGGGGG CCACCCTGCC CGACGCCGTG GGCGTGCTCG TGCTGCTCGA CATCGCCGAG
GACGCGCTGG AGAGCGCGGC CGGGGGCGCG CGCTCCGCCG TCCGGCGCCG CCACTGCTAC
GCCAACCCGC TGCACGGCAC GAACACCAAG GTGACGCCCT GGCGGGAGTT CGGCGGGACG
CGCACCATCC GCGTCCCGCT GTGCGCCGAC TGCGCCAAGG CCGTCCGCAA CCGGGCGCGT
CCCGTGGTCC TGCCCGCCGA GCACGAGGGC GCGCCGGTGC CCTACTACGA GGTGCCCGCG
GAGGAGAGCG TGTGGGCCGC CACCGGCTAC GGCACGCTCA CCGACGACCT CGTCCAGCGG
GTCCAGCGCG GCGACCACCG GCCCCGGGGG CGGGCCCGCT GA
 
Protein sequence
MPADSPPARY TARTLAATCA VLLLYPATGA LAQESDAQDP GAQSPQDRTQ AIADALEESP 
VYVDPAYESA FPAQDQERVA GVIESSDLDL YVIAVPLTSG DVWNGDAATL VSVVHDRRGG
GEAHYLAYDE VSGFHGEDYG PGVQEAAPAH HGALAASYGT DFDGSMLQQA DAAVEAALSD
DPRAAYEAAL RSYEEANPHI TGMGGDAGQG SGPGSGPLFA VGAVIAVLLA ILLALFVRGR
LTARVGRRRA ASITQHAAFD NADRAQLDSL VEQGERDLIE VGERLSRMDA DQGSDTRSGQ
SLQRALDARS AAAAVHDRMA AEGATLPDAV GVLVLLDIAE DALESAAGGA RSAVRRRHCY
ANPLHGTNTK VTPWREFGGT RTIRVPLCAD CAKAVRNRAR PVVLPAEHEG APVPYYEVPA
EESVWAATGY GTLTDDLVQR VQRGDHRPRG RAR