Gene Ndas_3776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3776 
Symbol 
ID9247645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4537858 
End bp4539906 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681680 
Protein GI297562706 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.382462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTGC TCCCCGGACG GTTCCTCGCC CGTGCCACCG CGCTCCCCGC GATCGCCGTC 
TCGGCCTGGC TCCTGGTGGC CTTCCCCCTC CTGGCCCTGG GCTCCCTCAC CCCGCTGACG
GCAGCCGTCC TCGGAACGCC CGCCGTGCTC GCGGCCTGCC TCCTGGTGCC CCGGCTCATT
CCGGACCCGC CCGATGGGGA GGACGCGCCC TGGTGGCCGG TCGCCCTCGT CGTGCTGATC
ACGGTCGCCT TCGCCGCCGT GCAGATCGCC TACCACGCCG AGGCCCTGGT CATCCGCCGC
GACCCCGCCT CCTACGCCAT GTACACGGCG TGGATCGCCG AGAACGGGTT CCTGCCCATC
CCGCAGCAGC GCGAGCTGAT CGCGGGCGAC GACCCCTCGC TCAGCTACCA GAGCCTGGCC
CACTACCAGC GCGGCGACGT CATCTGGCCG CAGTTCATGG CCGGTGCGCC CCTGGTGACC
TCCATCGGCT ACTGGCTGGG CGGGCTCGAC GGGATGCTCG TCACGACGCC GGTCCTGGGC
GCGCTGGGCG TGCTCACCTT CGCCGGGCTC ACCGCGCGCC TGGTCGGGAT CCGCTGGGCG
CCCCTGGCCG CCCTGGTCCT GGCCGTGTGC CTGCCCCAGC AGTGGGTCAG CCGCTTCACC
TACAGCGAGC CGGTCACCCA GATCCTGCTG CTGGGCGGCC TGGTCCTGGC CTACGACGCG
CTCGCGCGGC GCACCCGGAT CACCGACCGG TGGTCGGCCG CGCACACGCT GGCCGCCGTG
GCGGGACTGG CGTTCGGCCT GGGGCTGGTG GTGCGCATCG ACGCGATCCG CGACCTGCTG
CCCGTGGTCG GGTTCGTCGG CCTGCTGCTG CTCGCCCGCC GGGGACAGGC CCTGCCGCTG
CTGGGCGGGC TGGCCGCGGG CGTGGGGCTG GGCCTGTACG CCGGGTACGG GCTGTCCCTG
CCCTACCTGG AGTACCTCTC CGACTCCCTC AACCCCCTGC TGCTCATCAG CGCCGTCGTC
ATCGCCGTCA CCGTCGCGGC CACCGCCGCC CTGTGGCGCC ACGGCGTTCC GCGCGTGGAA
CGCGTGCGCC GGCTGCCCGA CGCGGTGGCC GCGCTCGCCC TGCTCGCCAT GGTGCTGTTC
GCCGTCCGCC CCCTGCTGTG GCCGGACCAC GGGCACGGCA GCGACTTCAC CGACGGCTGG
GTCGCCTACG TCCAGCAGCG CGAGGGACTG CCCGTCGAGG GCAGCCGCAC CTACTACGAC
ATGAGCCTGT ACTGGGTGGG CTGGTACGTG GGCCTGGCCA CGGTCCTGTT CGCCTCGCTC
GGCGTCGCCC ACGTGCTGCG CCGCCTCTTC CAGCGGCGCG ACCCCCAGTG GCTGCTGCCC
GCCATGCTGC TGGTGTGGAC GGTCGGCACC ACCCTGCTGC GGCCCGCCAT CACCCCCGAC
CACCCCTGGG CCAGCAGACG GCTGATCGCC CTGGTCATCC CGGCGTTCAT CCTGTTCGCC
GTCTGGTTCC TGGCCTGGCT CACCCGCTAC TGCGCCGTCG CCGCCCACTC CGGTCGGAGC
ACGGCGGCGG CCCGCGCCCT CCCCGTCGTG GTCGCCGCCA GCGCCTCGGT GGCGCTGATC
GTCCCCACCG CGGGCACGGC GGCCGGGATC ATGGGGTACA GACAGGACGT CGGCACCGTC
GCCGCCACGC ACCGGCTCTG CGCGGCGCTG TCGGAGGACG CCTCGGTGGT CGTGGTCGAC
TCCGACACGG CCGGTAACTA CATGCCGCTG CTGCGCAACG TCTGCGGGGT GCCCACCGCG
TCCATGGACG AGCCGACCCC CGGGGACGTG GACCGCGTGG TCTCCGAGAT CCACGAGCGC
GACCGCGACG CCGTGCTCGC CGCCTCCGAC TGGCAGGTCC TGGAGGACCT CACGGGCGGT
GAGGCCGAGC CCGAGCGCCC CTTCCAGGTG AACGCCCGGA TGGACCCGAG TACCCTCATG
GAACCGCCGA CCGGTTCCTG GGCCTTCAGG GGAAACGTGT GGGTGTCCGT GTTCCCCGAC
GGGGGATGA
 
Protein sequence
MRLLPGRFLA RATALPAIAV SAWLLVAFPL LALGSLTPLT AAVLGTPAVL AACLLVPRLI 
PDPPDGEDAP WWPVALVVLI TVAFAAVQIA YHAEALVIRR DPASYAMYTA WIAENGFLPI
PQQRELIAGD DPSLSYQSLA HYQRGDVIWP QFMAGAPLVT SIGYWLGGLD GMLVTTPVLG
ALGVLTFAGL TARLVGIRWA PLAALVLAVC LPQQWVSRFT YSEPVTQILL LGGLVLAYDA
LARRTRITDR WSAAHTLAAV AGLAFGLGLV VRIDAIRDLL PVVGFVGLLL LARRGQALPL
LGGLAAGVGL GLYAGYGLSL PYLEYLSDSL NPLLLISAVV IAVTVAATAA LWRHGVPRVE
RVRRLPDAVA ALALLAMVLF AVRPLLWPDH GHGSDFTDGW VAYVQQREGL PVEGSRTYYD
MSLYWVGWYV GLATVLFASL GVAHVLRRLF QRRDPQWLLP AMLLVWTVGT TLLRPAITPD
HPWASRRLIA LVIPAFILFA VWFLAWLTRY CAVAAHSGRS TAAARALPVV VAASASVALI
VPTAGTAAGI MGYRQDVGTV AATHRLCAAL SEDASVVVVD SDTAGNYMPL LRNVCGVPTA
SMDEPTPGDV DRVVSEIHER DRDAVLAASD WQVLEDLTGG EAEPERPFQV NARMDPSTLM
EPPTGSWAFR GNVWVSVFPD GG