Gene Ndas_2620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2620 
Symbol 
ID9246471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3124004 
End bp3125350 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680543 
Protein GI297561569 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0236435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00226971 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCACCGC TGCCCAGGAT GTCCGCCTCC GCCGCCTCTC CCGAGGTGGC GGCCCTGGTG 
GACCGCCTGC TCCGTGATTC GCGGCGCCCC GGGAGACCGG ACCTGCTGGC GTCCCCGGCC
GCCCCGCGGA CGCCTCCCGC GCCCCGGCTG CCCGTGGGGT CCCGCCAGGT GGGCACGTCG
GCGCTGCGGC TCCGGCTGCC CGACTCCCAC GTCGCCGACC AGAGCCCCGG CGCCCTCGCC
GTGCGCACCG GACTCCGCCT CCCCGGCCAC CCCGGCGGCC TGGGCGTCCC GGGCCCCACC
GCCCCGGCGC CCGGGCGCCC GGGGACCGGT GTCCCGCCCG AGGGGCCGGT CGTCGCCTCC
GAACCCGAGG GGCCCGAGGT CGACGGCGAG CGGATCCCGG CGCTGTACTG CCCGCCCGCC
GTCCGCGACG ATCCCGCCCT GGGGGACGAG GTCGACGAAC GGCTGCTGGT GTGGGCCGAG
GAGATGGGTG TCTACCCCGG CCAGCTCGAC CGGGTCCGCT CGGCCGGGTT CGGACGCCTC
ATCATGCTCG CCCACCCCGA GACCGACGAT CCCGACCGGC TGCTGGCGGC GGCCAAGTGC
GCCCTGGCCG AGTGGTCCGT GGACGACCAC TACGTCGACG GCGAGGCCGA GGAGGCACAG
CCGGAGCTGC TCGGCCAGCG CCTGGCCATC GCCCACTCGG TCATCGACCA GGCGCACCTG
CCGCTGCGCT ACGCGCCCCA GCTGGAGGAG GTCGTGCGCG CGGACCCCGT GATGCGGGCC
CTGCGCTCCA GTCTGGACGG CCTGGGCCTG TACGCCACCG CCTCGCAGGT GCGCAGGCTC
CGCCACGAGC TGGCGATCAT GTTCGTGGCC TACAACCAGG AGGGCGTCTG GCTGGCCACC
GGCCACCGGC CGCCGGTCTG GGAGTTCCTC ATGCACCGGC ACGAGAACAG CTTCGTGCCG
TGCATGGCGC TCATCGACGC GGTCGCCGGG TACGAGCTGC CGCACCAGGT GTTCTCCGAG
CCCAGCGTGC GCCGCGTGTT CACCCTGGCG GGCAGCGCGA GCGTGATCGT CAACGACCTG
TACTCCATGG GCAAGGAGGA CGTCACCGAC CTCAGCCTTC CCCGGCTGAT CGCCACCGAG
GACGGGTGCT CGCTGGGTGA GGCGGTGCGG CGCACCGTCG ACATCCACGA CGAGCTCATG
CACGCCTTCG AGGCCGAGGC CGCCGCCCTG GCGCTCACCG GCTCGCCGGA GCTGCGCCGC
TTCCTGTGGG GCCTGTGGGC CTGGCTCGGC GGCAGCCGCG AATGGCACGC CCGCAGCCCC
CGCTACCACG GGGCGGGCAC CGACTGA
 
Protein sequence
MPPLPRMSAS AASPEVAALV DRLLRDSRRP GRPDLLASPA APRTPPAPRL PVGSRQVGTS 
ALRLRLPDSH VADQSPGALA VRTGLRLPGH PGGLGVPGPT APAPGRPGTG VPPEGPVVAS
EPEGPEVDGE RIPALYCPPA VRDDPALGDE VDERLLVWAE EMGVYPGQLD RVRSAGFGRL
IMLAHPETDD PDRLLAAAKC ALAEWSVDDH YVDGEAEEAQ PELLGQRLAI AHSVIDQAHL
PLRYAPQLEE VVRADPVMRA LRSSLDGLGL YATASQVRRL RHELAIMFVA YNQEGVWLAT
GHRPPVWEFL MHRHENSFVP CMALIDAVAG YELPHQVFSE PSVRRVFTLA GSASVIVNDL
YSMGKEDVTD LSLPRLIATE DGCSLGEAVR RTVDIHDELM HAFEAEAAAL ALTGSPELRR
FLWGLWAWLG GSREWHARSP RYHGAGTD