Gene Ndas_2803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2803 
Symbol 
ID9246654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3348055 
End bp3349380 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content70% 
IMG OID 
Productprotein of unknown function UPF0118 
Protein accessionYP_003680721 
Protein GI297561747 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.168707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0979616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAGCCC AGCGGCCGGG AGTGTGGGCG CTCCTGAACA AGTGGCTCTC CGCACGCAGA 
GCCCGCGCCG AGCGCCTCGC CAGGCTGGAG GCGGAGAACG AGCGCTCCCG TGCCGAGCCC
CAGGCCGCGG ACGAGGGGCC GACCGAGCAG CACCAGGGAG ACGACAACCT CCTGCGGTCC
ATCAGCGACG TGGCCTGGCG GGTACTGCTC ATCGGCGTGG TGGCGGGCCT GCTCGTCTAC
GTGCTCGTCT ACCTGTCGGT CGTCACGCTG CCGGTGATCC TGGCGGTGTT CCTCACCGCC
CTGCTCATGC CGATCGCCAA CGGGCTGCGC CGCAAGGGGC TGGGCAGGGG GCTGTCGACC
ACCATCGCCC TGCTGGTCGG ACTCATCGTC TTCGGCGGCG TGATCTCGCT GATCGTCACG
CCCGCGATCC AGGGCTTCGG TCCGCTGGTG GACAGCGTCA CCAGCGCGAT CACCGAGCTC
CAGGACATCC GGCTGCCCTT CGTCGACCCG GCCCTGTTCA CCGACATGAT CGACGACGCC
TGGGCGCAGA TCCAGAGCAT GATCACCGAG AACCAGGACC AGCTGCTCAG CGGCGCCTGG
ACCGCCACCT CGGCGGTGAT CTCGGTCCTG GTCGGCATCG TCCTGATCAT CGCCCTGACC
GTGTACTTCG TGCACTCGGG CGACCAGCTC ATGGACTGGC TGGTCACCCT GCTGCCGGCC
CGCTCGCGCC CGGGCATGCG CCACGCGGGC GACGTCGCCT ACGGGGTCAT GGGGCGCTAC
GTGCGGGGCG TGGCCGCGGT CGGCTTCTTC GACGCCGTCG GTATCGGTAT CGCCCTGGTC
ATCTTCCTCG ACATCAACCT GGCCATCCCG CTGATCGTGC TGACCTTCGT CGGGGCCTTC
CTGCCGATCA TCGGCGCCTT CCTCACCGGC CTGCTCGCCG CCCTGGTGGC CTTCGTGACC
GAGGGCTGGG TCGTGGCCCT GATCATCGTC GGCGCCGTGC TCCTGGTGCA GCAGCTGGAG
AGCAACGTCT TCGCGCCGCG CATCTACGGC GCCTCGCTCG ACCTGCCCTC GCCGGTCGTG
CTCATCGGGA TCTCCGTCGG CGCGGTCGTC GGCGGTATCC CCGGCATGTT CCTGTCCACC
CCGGTGGTCG CCGTGCTGGC CGCGCTGCTG CGCAACCGCC CGCCCTCCAG CGGTGACGAC
TCCGGCGGAG GGGACGCGGA CGTGGCCGAG GTCAAGGCGG ACACCGTCGT GGTCAGGGCC
GACCAGGGGA CCGGCCAGGA CGCCTCCGGC GGAGCCACCG CGGTCGATCC CCCCGAACAG
AAGTAG
 
Protein sequence
MQAQRPGVWA LLNKWLSARR ARAERLARLE AENERSRAEP QAADEGPTEQ HQGDDNLLRS 
ISDVAWRVLL IGVVAGLLVY VLVYLSVVTL PVILAVFLTA LLMPIANGLR RKGLGRGLST
TIALLVGLIV FGGVISLIVT PAIQGFGPLV DSVTSAITEL QDIRLPFVDP ALFTDMIDDA
WAQIQSMITE NQDQLLSGAW TATSAVISVL VGIVLIIALT VYFVHSGDQL MDWLVTLLPA
RSRPGMRHAG DVAYGVMGRY VRGVAAVGFF DAVGIGIALV IFLDINLAIP LIVLTFVGAF
LPIIGAFLTG LLAALVAFVT EGWVVALIIV GAVLLVQQLE SNVFAPRIYG ASLDLPSPVV
LIGISVGAVV GGIPGMFLST PVVAVLAALL RNRPPSSGDD SGGGDADVAE VKADTVVVRA
DQGTGQDASG GATAVDPPEQ K