Gene Ndas_4745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4745 
Symbol 
ID9248627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5629263 
End bp5631017 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content71% 
IMG OID 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_003682637 
Protein GI297563663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCTG ATGAGAAGCC CTCTGCCTCC GGCAGGGACC ACACACCAGA CGACAGGCCC 
GAGGACGGCT CCACACCCCG ATCAGGGACC GAGAACACCG GCGACGACGC GGCGGGTGGG
GACAACACCA CCGCGTCCGA GACCGGGGCT GACGGAACCG GCCGGGCCCA CGAGTCGGAC
GAGACCGAGG GGGCGGAGGA GGTCGGAAAG GCAGAAGAGG CCCCCAACAC CGGTGTGACA
TCGGACACCG GAGGCGCCCA GGAGACGCAG CAGGCCCAGG AGACCCCGGA GGCTGACGAG
CCTGCCGGTG CCGGGGAGGC TGCCGAGGCT GGCGACACCC GAAAGTCCAC CGGTACCGAA
GGGGCCACCG AGACCGAGAA GGTCACCGGA GACGAGAAGG CCGTTGAGGC CGAGGAGGCC
ACTGCGGAAT CAGCGGCTGG ATCCGCCGCC GGGGACGAAG AGCGCACTGG GACCAAGGCG
CCCTCAGAGG CCGGTGGGGC TGGTGCCCCA GCCGGAGCCG CCGGGACCGG GGAGTCCGGA
GAGACCGCGG AGACCGCTGG GGCCGCCCCG TCCGCCGCTC CCGTCCGCAC GAAGCGCCGC
CGGACCGGCA GGATCCTGGT CTGGGTCGCC GCGAGCCTGG TCCTCGTCCT GGCCGCCGGG
GTCGGCACCG CCTACGGCTA CTACCGCTCG CTGCGCTCGG ACATGGTCCA GTACGACATC
GACGGGCTGC TCAAGGAGGA GGACCGGCCC GAGAGGATCA ACGACTCCGT CAACATCCTC
TTCATGGGCA CCGACGGCTA CGAGGAGGGC AGCACCGCCT ACTCGACGGA GTTCGAGGGC
GAGCGTTCGG ACTCGATCAT GCTGGCGCAC ATCTCACCCG AGAGCCGGGT GTCGGTGATC
AGCTTCCCCC GCGACTCGCT GGTGGCCCTG CCCGACTGCG ACCCCTACGG AGAAACCGAG
GGCACGCCCG GCTACTTCGG CATGATCAAC GCCGCGATGT ACCACGGCGG ACCGCCCTGC
GTGGTCAGCA CCATCGAGTC GCTGAGCGAC GTCCGCATCG ACCACTTCGT GCACCTCAGC
TTCATGAGCT TCCGGGACGT GGTGGACGCC ATCGGCGGCG TGGACATGTG CATTCCCGAG
CCGATGGAGG ACAGCCGGTC CAAGCTCGAC CTCGACGCGG GCCAGCAGAC CCTCGACGGC
GACGAGGCGC TGTCGTTCGT CCGGGCCCGC TACGAGATCG GCGACGGCGG CGACATCGGC
CGCATCGACC GCCAGCAGAT GTTCCTCGCG GCCCTGGCCG ACCAGGTGAC CAGAAACGAC
GTGATCACCG ACCCGGGCAG GCTCAACGCC GTTCTGCGCG CGGTGGCCGA GCACAGCGCC
ACGGACAGCG CCCTCACGTT CGACCGGATG CTGTCGATCG CCGTGACCCT GGCGGACGTG
GAGCTGACCG ACATCGAGTT CCACACCGTG CCCTGGTACC AGGCGCCCTC CAACCCCAAC
CGGGTCCTGT GGTACGAGGA CCAGGCCGAG GAGCTGTTCA CCGCCGTGCG CGAGGACCGG
CCCCTGCCCC TCACGATGGC CGACGAGGCG CCCGTTCCCC AGGACCCGCC CGGGGCCTCG
CCCTCCCCGG CGGACGAGGA GGTCGCGGAG GCCTCCCCGG ATGACGAGCC CGCCCGTCCG
GGCGTGGGAC GCGACGCCAC CTCCAACCCG TGCTCCGACG GCCTGGGCTA CGGCACCGGG
GACGAGATGG AATAA
 
Protein sequence
MPADEKPSAS GRDHTPDDRP EDGSTPRSGT ENTGDDAAGG DNTTASETGA DGTGRAHESD 
ETEGAEEVGK AEEAPNTGVT SDTGGAQETQ QAQETPEADE PAGAGEAAEA GDTRKSTGTE
GATETEKVTG DEKAVEAEEA TAESAAGSAA GDEERTGTKA PSEAGGAGAP AGAAGTGESG
ETAETAGAAP SAAPVRTKRR RTGRILVWVA ASLVLVLAAG VGTAYGYYRS LRSDMVQYDI
DGLLKEEDRP ERINDSVNIL FMGTDGYEEG STAYSTEFEG ERSDSIMLAH ISPESRVSVI
SFPRDSLVAL PDCDPYGETE GTPGYFGMIN AAMYHGGPPC VVSTIESLSD VRIDHFVHLS
FMSFRDVVDA IGGVDMCIPE PMEDSRSKLD LDAGQQTLDG DEALSFVRAR YEIGDGGDIG
RIDRQQMFLA ALADQVTRND VITDPGRLNA VLRAVAEHSA TDSALTFDRM LSIAVTLADV
ELTDIEFHTV PWYQAPSNPN RVLWYEDQAE ELFTAVREDR PLPLTMADEA PVPQDPPGAS
PSPADEEVAE ASPDDEPARP GVGRDATSNP CSDGLGYGTG DEME