Gene Ndas_1419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1419 
Symbol 
ID9245269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1739551 
End bp1741086 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content72% 
IMG OID 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_003679357 
Protein GI297560383 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.327426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGCGG AGGACGATCA GCGGGACGAG GCGCCCGCGC GCCCGGGTAC GGGCCGGGCG 
CTGCTGTGGA CCGCGGCGTC GGTTCCCCTC CCCGGGCTGG CCCACCTGCG GATGCGCCGC
AGGGTCGCGG GCGCGGTGAT CCTGGGGGTC TACCTGGCCG GGATCCTGGG GCTGGTCGTG
TGGGCGTGGC GGCTGGGCGC CGACGAGGCG AACACCATGG CGCGGCTGGC CACCATGGCG
CTCCAGGATC AGTGGCTGCT GGGAGCCATG GGCGTGGTGT TCGTCGTGGC GGTGCTGTGG
CTGACGGTCA TCGTGCACTC GTGGGTGATC ACCCGGCCCG CCGGGGCGCC CCGGAGCTCG
CGGGTGCTCG GCGCGGCCGT GGTGCTGCTG CTGTGCCTGA CCGTCGCGGC GCCCTCGGCG
CTGGCGCTGC ACGGCGGCTA CACGGCCTAC CAGACGCTGA ACAGCGTCTT CCACGCCGAG
GAGGACCCCC TCATGCCCCC GCACGACGAG GCCGACCCCT GGAACGGCCA GGAGCGGGTC
AACGTGCTGC TGATCGGCGC GGACTCCGCG GACAACCGCT ACGGGGTGCG CACCGACAGC
ATGATGGTGG CCAGCATGGA CACCGCGACC GGCGACACCG TGCTGGTGGG ACTGCCGCGC
AACCTGGAGA ACGTGCACTT CCCGGAGGAC AGCGCCCTGG CCGAGCGCTA CCCCGAGCCC
TACGGCTTCG ACCTGCTGCT CAACGACGTG TACCAGACGG TGGCCGAGGA GCCCGAGGAG
CTGGCGCTCA ACCCGGACGC GGCCAACCCC TCCGCCGACA CCCTCAAGAA GGTCATCGGG
TACAACCTCG ACCTGGAGAT CGGCTACTAC GCGATGGTCG ACATGATGGG CTTCCGCGAC
CTGATCGACG CGATCGGCGG CGTGGAGGTG CTCATCGAGG AGCCGATCCC CTACGGCGTG
CACGGCGGGG TGCTGGAGCC GGGCCTGCGC CGCCTCGACG GCCACGACGC GCTCTGGTAC
GGCCGCTCGC GGACCAACAG CGACGACTAC GGCCGGATGG GCCGCCAGGG CTGCCTGATC
AAGTACGTGG CCGAGCAGGT GGACCCGATG ACCGTCCTGA CGAGCTACCG CAGGCTCGCG
GGCGCCACCG AGCGCACCCT GAGCACGGAC ATCCCCCAGG CCAAGGTGCC CGCGTTCGTC
GAACTCGCCG ACAGGGTCAC CGACACCGGG AGCATGAGCA CGTTCCAGCT GTCGCCTCCC
CAGGTCAACA CGGCCAACCC GGACTGGGAG CAGGTCAAGG CACTGGTCGC CGAGGCCATC
ACCGGTGGCG GGCCCGAGGG CGACGACGTG GCCGCCGAGC CCTCCGGCGC CCCCTCCGGG
GAGGAGTCCG CCGCGCCCTC CGAGCCGGCC GAGCCCACGG AGGACGACGG GCTGACCGAG
TGGCAGGAGT ACACGGGCCT CGACGAGGAG GAGCCCGCCG ATCCGGGCCG CCAGGTGGGC
GAGGAGCCCA GCAACCTGGA GGCCCTGTGC CCCTGA
 
Protein sequence
MTAEDDQRDE APARPGTGRA LLWTAASVPL PGLAHLRMRR RVAGAVILGV YLAGILGLVV 
WAWRLGADEA NTMARLATMA LQDQWLLGAM GVVFVVAVLW LTVIVHSWVI TRPAGAPRSS
RVLGAAVVLL LCLTVAAPSA LALHGGYTAY QTLNSVFHAE EDPLMPPHDE ADPWNGQERV
NVLLIGADSA DNRYGVRTDS MMVASMDTAT GDTVLVGLPR NLENVHFPED SALAERYPEP
YGFDLLLNDV YQTVAEEPEE LALNPDAANP SADTLKKVIG YNLDLEIGYY AMVDMMGFRD
LIDAIGGVEV LIEEPIPYGV HGGVLEPGLR RLDGHDALWY GRSRTNSDDY GRMGRQGCLI
KYVAEQVDPM TVLTSYRRLA GATERTLSTD IPQAKVPAFV ELADRVTDTG SMSTFQLSPP
QVNTANPDWE QVKALVAEAI TGGGPEGDDV AAEPSGAPSG EESAAPSEPA EPTEDDGLTE
WQEYTGLDEE EPADPGRQVG EEPSNLEALC P